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An Approximation for the Variance of the UPCO 
Offered Load Estimate 


By E. J. MESSERLI 
(Manuscript received September 26, 1977) 


This paper develops a generalization of some available approxima- 
tions for the variance of the estimate for offered load to a trunk or server 
group operating in a blocked-calls-cleared mode, using measurements 
of usage, offered attempts (peg count), and overflow. The analysis takes 
into account the peakedness of the offered traffic stream, the level of 
blocking on the group, the duration of the measurement interval, and 
switch count errors due to sampling usage. The resulting approximation 
is quite accurate over a wide range of conditions, is easily computable, 
and clearly displays the role of the basic factors that control the pre- 
cision of the estimator. The variance approximation ts useful in studies 
of the relationship between traffic measurement errors and the per- 
formance of the provisioning and administration processes. 


l. INTRODUCTION 


The estimation of loads offered to a trunk group or server group op- 
erating in a blocked-calls-cleared mode plays an important role in many 
network-provisioning processes. The preferred measurement combi- 
nation for developing such load estimates consists of usage, offered at- 
tempts (peg count), and overflow attempts (usually referred to in the 
Bell System as UPCO measurements). This paper develops a generali- 
zation of some available approximations for the variance of the UPCO 
offered load estimate for a single measurement interval. The analysis 
considers the peakedness of the offered traffic stream, the level of 
blocking or call congestion for the group, the duration of the measure- 
ment interval, and switch count errors due to the sampling of usage at 
discrete points in time. The resulting approximation is quite accurate 
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over a wide range of conditions, is easily computable, and clearly displays 
the role of the basic factors that control the precision of the estima- 
tor. 

Variance approximations are useful in designing measurements and 
in studying relationships between traffic measurement errors and the 
performance of the provisioning and administration processes. For ex- 
ample, the relationship of actual traffic measurement accuracies (which 
can be further corrupted by wiring, data base, and recording errors) to 
the quality of the trunk provisioning process was studied in Ref. 1. The 
variance approximation developed here was useful in quantifying the 
background accuracy of the process. 

This paper is organized as follows. The basic approximation is pre- 
sented and discussed in Section II. The development of the approxi- 
mation is given in Section II]; supporting analysis of switch count error 
is developed in the appendix. Concluding remarks are given in Section 
IV. 


ll. THE BASIC APPROXIMATION 


Figure 1 illustrates UPCO measurements for a measurement interval 
of length T, with usage scan interval s. The UPCO estimate for the offered 
load during this measurement interval is given by 


OVERFLOW 


NO. OF 
TRUNKS 
BUSY 
n(t) 








OFFERED CALLS (PEG COUNT) 





-->| s }e-- USAGE SCAN INTERVAL 


L —-a-—o ore MEASUREMENT INTERVAL = ————-—-— | 


T 


PEG COUNT P=2Z (OFFERED CALLS IN MEASUREMENT INTERVAL) 
OVERFLOW O=2 (OVERFLOW CALLS IN MEASUREMENT INTERVAL) 
1 





= (NO. TRUNKS BUS 
NO. SCANS IN ang ee 


USAGE U= 
(measu REMENT INTERVAL 


Fig. 1—UPCO measurements. 
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average measured usage 


(1) 


G= 
1-measured blocking 


where the measured blocking is the ratio of overflow to offered attempts. 
It is well known that (under reasonable conditions subsequently dis- 
cussed) this is an unbiased estimate for the true offered load a during 
this interval. 

Early work on analyzing offered load estimators was carried out, 
among others, by R. I. Wilkinson,? who addressed the reliability of 
holding time estimates. In a 1952 paper,? W. S. Hayward, Jr., drawing 
on some of Wilkinson’s analysis, addressed the variance of offered load 
estimates based on sampled usage. Hayward’s model assumed Poisson 
arrivals, exponential holding times, and no blocking, yielding the re- 
sult 


varaye " (2+), (2) 


where a is the offered load in erlangs, h is the average holding time, and 
T is the length of the measurement interval. The parameter q is given 


by 


Iter? 
Ler" 


2, (3) 


where v = s/h, and s is the usage scan interval; g determines the variance 
contribution due to switch count (sampling) error, e.g.,q = 0 for s = 0, 
the continuous scan case. 

In more recent work, Hill and Neal* addressed the question of the 
variance of d for peaked traffic,* but did not consider congestion or 
switch count error. Through the application of an asymptotic result for 
the variance of the renewals for a peaked traffic stream, they obtained 
the expression 


Qhaz 
T 2 





(4) 


var(d) = 


where z is the peakedness factor for the stream. 
In this paper, we combine elements of both of these previous analyses 


* Peaked traffic refers to overflow traffic, or to streams containing some overflow traffic. 
The peakedness factor z(u) (or z if u is understood) is the equilibrium variance-to-mean 
ratio of busy servers when this traffic is offered to an infinitely large group of exponential 
servers with service rate ». The peakedness factor is one for Poisson traffic and is larger 
than one for overflow traffic. 
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——— FROM VARIANCE APPROXIMATION, eq. 5 
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Fig. 2—Comparison of variance approximations for N = 10 servers (h = 180s, T = 
3600s,s = 100s). 


and explicitly consider the effect of blocking on the group, to obtain the 
generalization 


var(d) = _ (22 + —— (5) 


where B is the equilibrium call congestion,* i.e., the fraction of attempts 
blocked. Thus, congestion basically adds a term to the previous various 
- approximations. 

Figures 2 and 3 show comparisons of the variance approximation (5) 
with the reference approximations obtained via the error theory devel- 


* The blocking B is defined in theory as the probability that an arbitrary attempt is . 
blocked. In practice, when the load parameters a,z are given, the blocking or call congestion 
B is assumed to be defined by the equivalent random method (Ref. 5), so that B = f(N,a,z) 
where N is the number of trunks in the group. Otherwise, as shown by Holtzman (Ref. 6), 
the blocking B is not uniquely defined by N,a,z, but may take on a range of values, de- 
pending on higher order characteristics of the traffic stream. The actual value of f(N,a,z) 
may be obtained from traffic tables normally used in administering trunking networks. 
It may also be estimated by Hayward’s approximation, f(N,a,z) ~ B(N/z,a/z) (Ref. 7), 
thus allowing Erlang B(.,.) tables or formulas to be used. 
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0.25 


NEAL AND KUCZURA RESULTS (REF. 8) 


FROM VARIANCE APPROXIMATION, eq. (5) 


VAR (a)/a} 


0.2 


NEAL AND HILL FORMULA, eq. (4) 
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Fig. 3—Comparison of variance approximations for N = 40 servers (h = 180s, T = 
3600 s, s = 100s). 


oped by Neal and Kuczura.®" These results assume that h = 180s, T = 
3600 s (i.e., h/T = 0.05), and s = 100s. For a wide range of congestion and 
peakedness conditions, the agreement between eq. (5) and the reference 
results is very good. Neal and Kuczura also determined by numerical 
comparisons that switch count error was a small contributor to var(d). 
Since q is small for typical scan-interval-to-holding-time ratios (e.g., q 
= 0.05 for s = 100s and h = 180s, which are typical scan intervals and 
holding times for Bell System trunks), this conclusion is also evident 
from eq. (5). 

Figures 2 and 3 also show the behavior of the Neal and Hill result, eq. 
(4). As the load per trunk increases, it is clear that the contribution of 
the congestion term in eq. (5) is increasingly important. These higher 
levels of congestion occur quite commonly on high usage groups, where 
a substantial fraction of the busy hour loads may be overflowed to an 
alternate route. As the load is increased to very large values, the coeffi- 
cient of variation using eq. (4) goes to 0, whereas Figs. 2 and’3 suggest 
that the coefficient of variation has a positive limit as a — ©. It can be 
shown that (for any z) as the attempt rate \ > ~, 

lim var(é)/a2 = h/TN, (6) 


A> @ 


* This error theory is applicable to general functions of the e UPCO measurements. The 
approximation developed for the. UPCO offered load estimate is computationally much 
more complex, as well as less transparent, than eq. (5). The Neal and Kuczura approxi- 
a, agreed well with simulation results, and hence is a suitable reference for comparing 
eq. (5). 
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where N is the number of servers in the group.* Equation (6) has a simple 
interpretation. The UPCO offered load estimate may be viewed as the 
product of essentially independent estimators for the attempt rate \ and 
for the mean holding time h. As \ > ~, the coefficient of variation for 
the first estimator goes to 0. Equation (6) represents the squared «vef- 
ficient of variation for the second estimator, i.e., the positive limit results 
from having only a finite number of carried attempts from which to es- 
timate mean holding time. For Figs. 2 and 3, the asymptotic limits for 
the coefficient of variation are 0.071 and 0.035, respectively. 

If a is assumed to have a mean a, and variance o,2, one is often in- 
terested in estimating a,. The results of this section can be applied to 
obtain var(d,) for a single measurement period by interpreting them as 
conditional results, i.e., var(é|a), in the expression 


var(G,) = og2 + Eq var(a|a). (7) 


In many cases, the o,2 term can be a significant contributor. For ex- 
ample, in trunk engineering o,2 may represent a day-to-day variance 
under an i.i.d. model for busy-hour loads (in this case, a, is usually es- 
timated from 5 to 20 busy-hour loads) and can be quite large in relation 
to the other sources of variability. 


iil. DEVELOPMENT OF THE APPROXIMATION 


Consider a full access group of N servers operated in a blocked-calls- 
cleared mode. The offered traffic process is assumed to be a (nonlattice) 
renewal process with rate parameter \, and server holding times are 
assumed to be exponential with hang-up rate ». We define the mean and 
peakedness of the offered load by a = A/y, z = var(n(t))/E(n(t)), where 
n(t) is the equilibrium occupancy when the renewal process is offered 
to an infinitely large group of exponential servers with rate u. The pa- 
rameters (a,z) are conventionally used in traffic engineering, and hence 
it is useful to relate the variance approximation to these parameters: 

For a measurement period of length T, let u,p,o denote average 
measured usage, offered attempts, and overflow attempts, as illustrated 
by Fig. 1. The average measured usage is defined by u = 1/m 2 peer 
n(js) ifs > 0, and byu = 1/T f@n(t)dt ifs = 0, where n(t) is the number 
of busy servers at time ¢. It is assumed that equilibrium conditions apply 
at the beginning of the measurement interval, both for the occupancy 
on the servers and for the renewal processes corresponding to arrivals 
and overflows. 


* This result is not the same as the limit obtained from eq. (5) as a > ~, which gives (1 
+ q)h/TN. The discrepancy arises because the model for switch count error used in the 
development of eq. (5) breaks down as \ — ~, For this unrealistic limiting case, the servers 
are occupied 100 percent of the time, and no error is introduced by scanning. The correct 
result is thus obtained by noting that the carried attempt process approaches a Poisson 
process with rate Nu as \ > ~. 
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The UPCO estimate for the offered load a over the measurement period 
is 





got apt ak (o (8) 
: 1—o/p Pe ate ); 

where c = p — o. Thus, G@ may be viewed as the product of separate es- 
timators for the arrival rate (p/T) and for the average holding time 
(Tu/c). The approximation for var(G) is obtained by introducing an 
approximate treatment of the scanning error, and then by examining 
(8) for large T. However, while the structure of the approximation is 
motivated by asymptotic analysis, the validity of the approximation is 
based on its accuracy for realistic values of T. 


3.1 Treatment of scanning error 


The scanning error for usage affects only the value Tu in (8), which 
may be expressed as 


c 
Tu = dD hj+ro-rr, (9) 
j=l 


where h j is the sampled holding time estimate for the jth call to be ac- 
cepted by the group, A j 40,s,2s,---}, and ro, rr are end effects. In par- 
ticular, if the jth call to be accepted by the group was hit by &; scans, then 
h; = kjs may be viewed as the sampled holding time estimate for this call. 
The variable ro is the total measurement period usage attributable to 
calls already in progress at the beginning of the interval, while ry is the 
total usage due to accepted calls that would be measured in the subse- 
quent measurement period of length T. | 

Throughout this analysis, we make the following simplifying as- 
sumptions: 

(i) h jJ = 1,2,-++,c are independent random variables. 

(ii) h; =h; + e; where e; is the scanning error that results when a call 
with exponential holding time h; begins at a time which is uniformly 
distributed between two successive sampling instants. 

These simplifying assumptions hold exactly for the case B = 0,5 = 
0 (no congestion and continuous scan) and any 2, since all calls are carried 
and the holding times are i.i.d. exponential random variables. They also 
hold exactly for the case B = 0, s > 0, and z = 1, since for a Poisson 
process the arrivals in disjoint intervals are independent. Furthermore, 
given a fixed number of arrivals in an interval (in particular, an interval 
of length s), the arrival times are independent and uniformly distributed 
within the interval. Thus, the simplifying assumptions—while not always 
true—can be rigorously justified for some important cases. In general, 
they can be expected to be reasonable assumptions if the usage on each 
server in the group does not approach unity, i.e., if congestion is not too 
severe. 
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As a result of the simplifying assumptions, the scanning error need 
only be examined for an isolated call. The analysis for this situation is 
treated in the appendix, where it is shown that with e =h —h =ks —h, 
i.e., the sampled holding time minus the true holding time, 


E(e) =0 £10) 
cov(h,e) = 0 (11) 
Aree = 2| A h2q, (12) 
l-e-¥ 


where v = s/h, h = »~1. For s = 0 (continuous scan), var(e) = 0 as ex- 
pected, and hence these results cover both the continuous or the discrete 
scan case. 


var(e) = h2 |» 


3.2 Asymptotic analysis of variance 


Since p corresponds to the arrivals for a renewal process, x + p/T is 
asymptotically normal with mean ) and variance of the form O(1/T) 
(Ref. 9, p. 40). It is established in Ref. 10 that the variance can be ap- 
proximately expressed in terms of the peakedness z 


var(x) = (2z — 1)A/T. (13) 


As noted in Ref. 4, this approximation has been found to be quite good 
fora > z— 1,and T = 10h. Although the carried calls ¢ do not necessarily 
correspond to a renewal process (unless c = p), c/T is also asymptotically 
normal with mean \(1 — B), (where B * limp. .(o/p)) and variance 
O(1/T). This follows since if B > 0 the overflow process o is a renewal 
process, and the carried calls between overflows are independent for 
successive interoverflow periods. The only other asymptotic result 
needed is the following one, the proof of which is essentially the same 
as that for the function of sampling moments theorem given on p. 366 
of Cramér:!! 

If g(.,.) is a twice continuously differentiable function in some neigh- 
borhood of the point A,A(1 — B), then g(p/T,c/T) is asymptotically 
normal with mean g(A,A(1 — B)) and variance O(1/T). It follows that 


E(g(p/T,c/T)) = g(4,A(1 — B)) + O(1/V’T). (14) 


Now for large T, the end effects ro,r7 in (9) can be ignored at the 
outset. In particular, we have E(Tu) = O(T), var(Tu) = O(T), whereas 
E(ro — rr) = o(1), var(ro — rr) = O(1). (In general, ignoring these end 
effects is valid when T/h is reasonably large, e.g., T/h = 10.) Thus, de- 
fining* y = Djay h j/c, where the h; satisfy the simplifying assumptions 
made for handling the scanning error, it follows from (10) to (12) that 


* While y can be defined to be 0 for c = 0, in order to simplify subsequent notation, we 
shall assume that P(c = 0) = 0. This is reasonable even for the typical values of T that are 
of interest in practical applications. 
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E(y) = E(h) =h (15) 
h2(1 + q) 
AT(1 — B) 


where we have used (14) to evaluate E(1/c). 
Turning our attention next to d, we have 


var(y) = var(h)E (:) = + o(1/T), (16) 


d=xy (17) 
var(d) = E(x2y2) — B2(xy). (18) 
In order to simplify this expression, we first note that 
E(y|c) = 


and hence 
E(xy) = Ep E(xy|p,c) = Epc(xh) = = E(x)E(y); (19) 


i.e., x,y are uncorrelated, confirming that @ is an unbiased estimate of 
a. By the same conditioning, we also obtain 


E(x2y?) = h2(E(x2) + (1 + q)E(x2/c)) (20) 

and since 
E(x2)E(y?) = h2(E(x?) + (1 + qg)E(1/c)E(x2)), (21) 
E(x?y2) = E(x?)E(y?) + (1+ q)h?w, (22) 


where w = cov(x?,1/c). Substituting (19) and (22) into (18) and identi- 
fying terms, we have 
var(d) = E2(x) var(y) + E2(y) var(x) + var(x) var(y) + (1 + q)h2w. 
(23) 
By direct substitution of the means and variances for x,y 
ah(1+q) 
T(1 — B) 
It remains to show that w = 0(1/T). But Tw = cov(x?,1/(c/T)) and hence 
by (14) it follows that Tw = o0(1),i.e., w = o(1/T). This completes the 
analysis; the variance approximation given in eq. (5) corresponds to 
terms of O(1/T) in (24). 


var(d) = + (2z —1) + o(1/T) + (1+ q)h2w. (24) 


IV. CONCLUSIONS 


In this paper, we have developed a simple approximation for the 
variance of the UPCO offered load estimate commonly used in offered 
load estimation. This approximation shows clearly the role of source load 
variation, switch count error, peakedness, congestion, and length of the 
measurement period. Relative to previous work, the main contribution 
is the explicit inclusion of congestion. Thus the results are of particular 
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interest for high congestion situations such as occur in measuring loads 
on high usage groups. 

While the basic approximation is developed here for a single mea- 
surement interval, it can be easily applied in analyzing load estimates 
based on the average load over a number of single measurement inter- 
vals. 
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APPENDIX 
Analysis of Switch Count Error 


In this appendix we analyze, using methods similar to Hayward,? the 
following switch count error model: (i) a call with holding time h begins 
at a time uniformly distributed between two successive sampling in- 
stants, (iz) the sampling interval is of length s, (ii) the holding time is 
exponentially distributed with rate parameter y. 

For an arbitrary call, the error e between the true holding time h for 
the call, and the “sampled holding time,” is given by e = ks — h, where 
k represents the scan count for the call, ke{0,1,2,- - }. The scan count for 
the call is simply the total number of scans that occur during the time 
the call is in progress. 

Since e e[—s,s], it is convenient to define a normalized error e’ = k — 
h’, where h’ = h/s is exponentially distributed with rate parameter p’ 
= us = s/h. The density of h’ is therefore given by 


f= |), = pt 
we#E £20 
Define x’ = x/s, where x is uniformly distributed in [0,s] and represents 
the time from a sampling instant to the beginning of a call. Given x’¢[0,1], 
it is straightforward to show that the conditional probability density 
of e’ ate’ =yis 


(25) 


0, ye[-(1 — x’),x’] 
2 [Ree Yard =29e 1. 


The only case for which a negative argument can occur in any term in 
the preceding sum is for k = 0, y > 0. Thus, 


, 


fo] u 


Me 


bu 


o> 
i 


1 ; 
f(k — y) = ——— w’e#’? for y <0. 
0 l-—e7 
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Defining r = e-+’, (26) becomes 








0 x-<y sl 
wleryY O<y <x’ 
1l=?7T 
g(y |x’) = (27) 
p/er'y —(1—x’)Sy<0 
Y ore a 
0 ~l<y<-(1—- x’). 


To simplify obtaining of moments for e’, we define G(a) = E(e%e’) = 
E,E(e*’|x’). Using (27), 


1 x! : 0 : 
G(a) = —— E,, E §, petu’taydy + ff pie randy |. 
Bg 0 —(1-x’) 





(28) 
After integration, one obtains 
f 1l+r vg (e* + re-%) p’ 
GG) S122) a fe a 
ta) a + ;) G _ ~) (u’ + a)? l-r (u’ + a)? 
(29) 
We have G(0) = 1, G’(0) = 0, and 
1t+ril 1 
G’”(0) = == 2 (30) 
LSE (u’)? 
hence, 
E(e)=0 (31) 
—, /Lte7s/h s 
— 2 ————-- == 
var(e) = h? (+52), (32) 


which establishes (10) and (12) of the main section. 
To establish the covariance between h, e, we note that because of (31), 
cov(h,e) = E(he) = s2E(h’e’). But 
E(h’e’) = Ey | Peony 27h — Fle - y)dy | 
—(1-x’) k=0 


, 


=Ey| fot -»ray | 


f 


+Ey| {> bofh—ydy|. (33) 
—(1-x’) k=0 
The first term is — var(e’). To evaluate the second term, we note that 


> kyf(k —y) = X kyp’e“"”4-Y) = yer'Yp’ kr 
k=0 k=0 k=0 
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: 2 ; d/ 1 , 
= yer'Yu’r SY krk-1l = yer'p’r — Game! r=e 7H, 


dr 
Therefore 
Ey | Li & hott - yay | 
—(1-x’) k=0 


r x’ : 


Thus, we are led to define the function 





x’ 
H(a) = Ey if pretwtorvdy |. 
=(l=x1) 
Carrying out the integration yields 


Qu’ pw =et+r2e-a 


aaa (u’ + a)? 7" (u’ + a)? r 


(35) 


The expectation in (34) is now evaluated as 





es MOI Dg dea. ede 
Og ae 3) 


giving 
x’ fe) 1 1 4 r 9 
Ey kvftk — y)d tue 4 ry 


Therefore, E(h’e’) = — var(e’) + var(e’) = 0, i.e., h’ and e’ are uncorre- 
lated random variables and 





var(f) = var(ks) = dash) + var(e). (36) 


Remark: Hayward? treats switch count error and source load variation 
separately, assumes independence, and adds the separate variances to 
obtain an approximate result. He noted that the errors were probably 
correlated, though weakly, and that (at that time) no method to take this 
into account was evident (Ref. 3, p. 363). Since cov(h,e) = 0, it follows 
from this analysis that (for the same model studied by Hayward) the 
errors are in fact uncorrelated. It was also pointed out by the referee that 
an alternate proof that cov(h,e) = 0 can be obtained by noting that the 
scan count k is geometrically distributed for k = 1. Thus, by directly 
evaluating var(ks), one finds that var(ks) = var(h) + var(e), which 
implies cov(h,e) = 0. 
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Within the population of voiceband telephone channels, few channel 
characteristics are as pervasive in their impairment of high-speed data 
communication as nonlinear distortion, which cannot be removed or 
equalized in the receiver as easily as can linear distortion. The purpose 
of this paper is to report on an investigation of a QAM receiver incor- 
porating adaptive equalization of nonlinearities as well as adaptive 
decision feedback equalization and data-aided carrier recovery for 
mitigation of linear distortion and phase jitter, respectively. Non- 
linearities are equalized by adding to the received in-phase and 
quadrature signals a weighted sum of nonlinear functionals of the re- 
ceived signal and of modulated previous receiver decisions. The choice 
of nonlinear terms in the sum is based on a channel model incorporating 
quadratic and cubic nonlinearities as well as linear dispersive elements. 
The adjustment of the weighting, or tap, coefficients for the various 
terms is based on a gradient algorithm, as is the adjustment of the 
linear tap coefficients and the carrier phase reference. The feasibility 
of nonlinearity equalization on real voiceband channels was confirmed 
in a test in which recorded 9600-bps QAM signals, received from a 
worse-than-average set of 17 voiceband telephone channels, were 
processed by a computer-simulated version of the proposed receiver 
(termed the NL receiver). The observed error rates for all channels were 
lower, in some cases by several orders of magnitude, than those achieved 
by computer-simulated versions of the linear receiver and of a decision 
feedback equalization receiver (termed the DFE receiver). 


l. INTRODUCTION 


The prevalence of nonlinearities and their distorting effect on high- 
speed data transmission over voiceband telephone channels has long 
been recognized.! The effect of nonlinear distortion on linearly modu- 
lated data signals is to introduce nonlinear intersymbol interference and 
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reduce the margin against noise. For data rates above 4800 bps, nonlinear 
distortion is the dominant impairment on many voiceband telephone 
channels. Experimental studies have measured nonlinear distortion and 
related the observed error rates for specific modulation formats to this 
and other measured impairments.” Estimation of performance for data 
transmission in the presence of nonlinearities can be carried out* but 
gives little insight into the problem of receiver optimization, except for 
certain simple nonlinear channel models.® 

Recognizing that nonlinearities in transmission channels generally 
coexist with linear elements such as filters, one is led to consider a general 
nonlinear receiver structure, based on a Volterra or Wiener kernel 
characterization® of a general nonlinear system such as that proposed 
in Refs. 5, 7, and 8, the latter in connection with adaptive echo cancel- 
lation. In the present work, we extend this approach by generalizing the 
structure of a passband decision feedback equalizer, previously studied 
in connection with linear channel distortion,? to process nonlinear as well 
as linear functionals of the incoming signal and prior decisions.* 

The new receiver structure is based on a model of a passband channel 
with quadratic and cubic nonlinearities, as well as linear filters. We re- 
port on the simulation of the new receiver and on comparisons of its 
performance with two other previously simulated 9600-bps QAM re- 
ceivers on a worse-than-average set of voiceband telephone channels. 
The new receiver is referred to as the NL receiver. The other two re- 
ceivers, designated LE (linear equalization) and DFE (decision feedback 
equalization), are not designed to compensate for channel nonlinearities. 
Their performance is compared over the same set of voiceband telephone 
channels in Ref. 9. The simulated LE receiver is described in Ref. 10. 


ll. SUMMARY OF THE MAJOR RESULTS 


The relative performances of the three simulated receivers on the same 
set of recorded, received, 9600-bps data signals are briefly summarized 
as follows: On every channel, the NL receiver yielded a lower error 
probability than the other two receivers. For 13 out of the 17 channels, 
the improvement in error rate was equal to or better than about an order 
of magnitude. Another gauge of the degree of improvement offered by 
the NL receiver is the fact that it increased the number of channels 
yielding a better-than-10~‘ error rate from 8 to 15. On one channel, 
whose major impairment was second harmonic distortion, the NL re- 


* Figure 3a summarizes the structure of the nonlinearity-equalizing receiver. 
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ceiver’s error rate bested that of the DFE and LE receiver by over four 
orders of magnitude. Figure 5 is a bar graph summarizing the error rate 
comparisons. 

The apparent attractiveness of the NL receiver structure is, however, 
tempered by its greater complexity. A large number of nonlinear tap 
coefficients is necessary to account and compensate for the dispersive 
nonlinear effects typically encountered on voiceband channels. In the 
simulations summarized above, the LE and DFE receivers each had 32 
complex tap coefficients, but the NL receiver was, roughly speaking, 
comparable in complexity to an LE receiver with 134 complex tap coef- 
ficients. Reducing the number of coefficients in the NL receiver lowered 
its performance margin over the other receivers. Furthermore, the best 
allocation of a fixed number of tap coefficients varied from one channel 
to another. These points are explored more fully in later sections. 

In spite of the greater complexity of the NL receiver structure, the 
performance comparison of the three receivers does indicate the im- 
portance of alleviating nonlinear distortion for high-speed data trans- 
mission. 


lll. THE CHANNEL MODEL 


Obviously, the effect of channel nonlinearities on a passband QAM data 
signal must be understood before a compensating receiver structure can 
be suggested. A general representation of a bandlimited QAM signal is 
as the real part of a complex waveform: 


x(t) = Re | ere ~~ A(n) F(t — nt) |, (1) 


where j = V—1, f; is the carrier frequency, A(n) is a quantized complex 
number representing the information symbol in the nth symbol interval 
(for example, in the case of four-level QAM, the real and imaginary parts 
of A(n) assume one of the four possible values +1, +3), T is the reciprocal 
of the baud, and F(t) is a complex pulse waveform. 

In the case of QAM signals, extraction of the information symbols 
represented by the complex number A(7) requires two receiver outputs, 
which are derived by appropriate operations on both the received 
passband signal and on its quadrature version, or Hilbert transform. A 
phase-splitting filter is used to obtain both in-phase and quadrature 
versions of a voiceband data signal. 

The complex waveform 


X(t) = ef2thet F A(n) F(t — nT) (2) 
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is assumed analytic;!! that is, its spectrum is twice the Fourier transform 
of x(t) for positive frequencies and is zero elsewhere. Furthermore, we 
assume the spectrum is limited on the high side to frequency 2f,. Note 
that the Fourier transform ¥(f) of the complex pulse F(t) is not neces- 
sarily symmetric about f = 0, but it is assumed to be strictly band-limited 
to -f. <f <f;. The Nyquist frequency is 1/2T Hz. Figure 1 shows a 
sketch of ¥(f) and of ¥(f — f.), which is the Fourier transform of 
el2nfet F(t), 

The notion of analytic signals is a notational convenience. The Hilbert 
transform, or quadrature version of a signal u(t), is a linear functional 


of u(t): 
Yi oh, ee) 
u(t) = . dr. 


ab = 7 
It can be shown that there is a unique analytic signal whose real part is 
u(t), and that u(t) is then just the imaginary part of the analytic signal. 
Conversely, any analytic signal comprises some real signal plus j times 
its Hilbert transform. Since QAM systems operate on both in-phase and 
quadrature versions of signals, they are most conveniently represented 
by means of analytic signals. 

The nonlinear receiver structure will be based on the simple nonlinear 
channel model shown in Fig. 2, using the notation of analytic signals. 
Filters 1, 2, and 3 are passband with the same bandwidth as the trans- 
mitted data signal. The filters may include the receiver’s input filter as 
well as the linear response of the channel. The quadratic and cubic 
memoryless nonlinearities with attenuated outputs account for second 


|F (#)| 


0 1/2T fc f 


|F(f—fe) | 


0 f, 2f. f 


Fig. 1—Fourier transforms of |F(f)| and |F(f — c)|. 
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FILTER NO. 1 






INPUT 
DATA 
SIGNAL 


Fig. 2—Model of a nonlinear channel. 


and third harmonic distortion, respectively. Additional impairments 
not shown in Fig. 2 are phase jitter, which implies multiplication of the 
complex received signal by e/*), and additive noise. 

The result of passing the transmitted waveform through the linear 
portion of the channel (filter 1) is an analytic waveform in the form of 
eq. (2). A passband linear!°-!? equalizer (LE) can be used to minimize the 
mean squared error between its output, sampled at times nT’, and a 
reference A(n)e/(2tfenT+4(n)) which is the complex information symbol 
modulated to passband with a receiver phase reference 6(n). In a linear 
receiver, the passband equalizer output is demodulated [multiplied by 
e—j@xfenT+6(n))] and then quantized to yield a decision A(n). A passband 
equalizer configuration which is theoretically more effective in com- 
batting linear intersymbol interference is the passband DFE, described 
in Ref. 9. 

To motivate a receiver structure which is appropriate for nonlinear 
distortion as well as linear distortion, we must consider the analytic 
signals emanating from the quadratic and cubic path elements of Fig. 
De 

It is shown in the appendix that the analytic signal output from the 
model of Fig. 2 is of the form 


R(t) = Uolt) + ef?*etUyy(t) + e~F2rfetUy9(t) 
+ el4tfetUo(t) + es6*fetU s(t), (8a) 


where 


Uo(t) = D2 Alny)A(n2)*Golt — n1T,t — neT) (3b) 


n1,N2 


Uist) = VD A(n) F(t — nT) 


+ SY A(n,)A(ne)A(n3)*Gqy(t — nyT,t — neT,t —n3T) (8c) 


N1,72,N3 
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Urol(t)= SY Alny)*A(ng)*A(n3)Gio(t — niT,t — neT,t — n3T) 


- (3d) 

Uo(t)= SX Al(ny)A(n2)Golt — n1T,t — neT) (3e) 
U3(t)= DY Al(ny)A(n2g)A(ng)Ga(t — niT,t — neT,t — nT), 

(3f) 


where asterisks denote complex conjugates. 

The various U terms are seen to be linear combinations of products 
of complex information symbols A(n), A(n1)A (ng), A(n1)A(n2)A(n3)*, 
etc. Each modulates a harmonic of the carrier wave. The term e/27/ct- 
U1,(t) includes the linear response of the channel to the data signal and 
also a component resulting from cubic distortion. The terms Uo(t) and 
U2(t) result from the quadratic nonlinearity and the terms U19(t) and 
U3(t) result from the cubic nonlinearity. Additional terms would, of 
course, result from the assumption of additional nonlinear elements in 
the model of Fig. 2. The generalization of expression (8) to an infinite 
power series would be a complex passband version of a Volterra expan- 
sion. 


IV. THE NONLINEAR RECEIVER STRUCTURE 


The receiver structure to be studied here includes the passband QAM 
decision feedback equalizer discussed in Ref. 9, plus nonlinear processing 
suggested by the set of eqs. (3). Let Y(n) be the receiver’s complex output 
at time t = nT. This output is quantized to form the decision A(n), which 
equals the original transmitted symbol A(n) if no error occurred. Let 
the demodulator’s phase reference at time nT be 6(n). Let (Wi)}__n 
and {Bj}, be the complex linear forward and feedback tap coefficients 
respectively, and let {R(n)} be the complex receiver input, sampled at 
times nT. Then 


| i M y 
Y(n) = e-JQ@nferT+000)) WO* R(n —k) — & BY* A(n —k) 
k=-N k=1 


+ Ynz(n)e —j(2xfenT+6(n)) (4a) 


where Yyz(n) consists of nonlinear functions of {R(k)} and {A(k)},<n. 
The linear part of eq. (4a) implies a demodulated linear combination 
of 2N + 1 receiver input samples minus a linear combination of M pre- 
vious decisions. 
The nonlinear term Yyz(n) is heuristically suggested by expression 
(3) in the following way: (i) Assume that at time nT the previous receiver 
decisions A(k) = A(k)(k <n) and that they are available to form the 
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nonlinear feedback terms. (ii) In any terms of expression (3) involving 
decisions A(z) not yet made at time n(k = n), replace A(k)eJ27/cekT+4(®) 
by R(R) to form the forward nonlinear terms. The resulting expression 
is 


Yni(n) = > Wire R(n — ky)R(n — ko)* 
1,82 


+ yy WH ng R(n — ky)R(n — ko) R(n — ks)* 


k1,k2,k3 
+ YY Wee ng R(n — k1)*R(n — ke)*R(n — ks) 
k1,k2,k3 
+ > WP, R(n — ki)R(n — ko) 
k1,k2 
+ YY Wrong R(n — k1)R(n — ko) R(n — ks) 
k1,k2,k3 
— efi) BY, A(n — ky)A(n — ko)* 
k1,ke 
>1 
— ei 2nfenT+00))  S°  BYN ng A(n — ky) A(n — ka)A(n — ke)* 
k1,k2,k3 
>1 
— eJ@xfenT-H(n)) > Bil)" A(n — ky)*A(n — ke) *A(n — k) 
k1,ka,k3 
>1 
— ef(AxfenT49)) BE, A(n — ky)A(n — ko) 
ky,k2 


21 
— 7 6 ; k 
eJ (6xfenT+6(n)) oy BP nok 
k1,ko,k3 
21 


x A(n — k,)A(n — ko) A(n — kg). (4b) 


The formidable-looking expression (4b) is a linear combination of 
products of receiver inputs and their complex conjugates, minus a linear 
combination of products of previous decisions and their complex con- 
jugates, modulated by appropriate harmonics of the carrier. 

Figures 3a and 3b are block diagrams of the NL receiver. The cross- 
hatched boxes in Figure 3a show the nonlinear processing that has been 
added to the basic decision feedback equalization structure described 
in an earlier paper. 


V. ADAPTATION OF RECEIVER PARAMETERS 
As in the linear and decision feedback equalization receivers, the pa- 
rameters {W}, {B} and @ are adjusted in an estimated gradient algorithm 


to minimize the average value of the squared error magnitude |E(n)|? 
defined by 
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Fig. 3—(a) Basic structure of the NL receiver. (b) Details of the nonlinear signal processing. 
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E(n) = Y(n) — A(n). (5) 


The error E(n), as in the previous receivers, is a linear function of the 
parameters {W} and {B}; consequently, the expression for |E(n)|? is 
convex in these parameters. 

In writing the updating equations for the {W(n)} and {B(n)} coefficients 
and for 6(n) in the nth symbol interval, it is convenient to use the symbol 
e(r) to denote the observed passband error after the decision A(n) has 
been made: 


e(n) = [Y(n) — A(n) ei @rfenT+9(n)), (6a) 


thus, if A(n) = A(n), |E(n)|? = |e(n)|?2, and the expression for the gra- 
dient of |«(n)|? with respect to each parameter determines an adjustment 
algorithm for that parameter. The adjustment equation for 0(n) is as 
follows: 


F 2% = alm|e(n)*Z(n)] 
O(n + 1) = O(n) AM)? ; (6b) 
N 
where Z(n) = > WW* R(n —k) 
k=—N 
+ > Wi, R(n — ki)R(n — ko)* 
k1,k2 
+ ‘ 2 Wy kg h(n — ky)R(n — ko)R(n — ks)* 
1L,22,R3 
+ YD Whe eg R(n — k1)*R(n — ko)*R(n — ks) 
ki,k2,k3 
a > WP, R(n — ky)R(n — ke) 
an ; a Wi o,kg h(n — ky)R(n — ko)R(n —k3) (6c) 


is the sum of all the forward terms comprising Y(n). The adjustment 
equations for the {W} and {B} coefficients are as follows: 


WH,.(n + 1) = We. (n) — Boe(n)*R(n — ki)R(n — ko)* (6d) 
WLP (n + 1) = WIP (n) — Bie(n)*R(n — k) (6e) 


WEY, ko (n + 1) = Wilks (n) ak Brure(n)*R(n = ky) 
-R(n — ko)R(n —k3)*  (6f) 


WH),.ks (n + 1) = Wie (n) “ar By2e(n)*R (n r k1)* 
-R(n — ko)*R(n — ks) (6g) 


Wen. (n + 1) = WE, (n) — Boe(n)*R(n — ky)R(n — ke) (6h) 


WE roks (n + 1) a a k3 (n) ~~ B3e(n)*R (n ~e ky) 
-R(n — ko)R(n —k3)  (6i) 
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Be ng (n + 1) = BY, (n) + yoe(n)*A(n — ky)A(n — kg)*ei4™) — (6j) 
BE? (n + 1) = BP (n) + yie(n)*A(n — R)eI@rfen T+") (6k) 


BE ks (n+1)= BE a ks (n) 7 
+ y11e(n)*A(n — ky)A(n — kg) A(n — kg) *eI2fenT+0))— (6]) 


Bev ia (n+ 1) = By BY 2,k3 (n) : 7 
+ yy9e(n)*A(n — ky)*A(n — ke)*A(n — kg)e J 2rfenT+(n)) (6m) 


BP eg (n + 1) = BE py (n) + Yoe(n)*A(n — ky) 
-A(n — ko)ei(4tfenT+0(n))  (6n) 


BY hokg (Nn +1) = BED aks (nr) : : 
+ y3e(n)*A(n — ky)A(n — ko)A(n — ka)etOrfenT+4(n)), (Go) 


The set of eqs. (4) through (6) defines the structure of the nonlinear 
QAM receiver that has been simulated. The a, 8, and y parameters are 
positive constants, chosen to ensure reasonably fast convergence and 
stability in the presence of noise. To enable compensation of rapidly 
varying phase jitter, the phase tracking constant a was set to the rela- 
tively large value of 0.4. The other constants chosen were: 


61 = ¥1 = 0.001, Bo = Be = Yo = y2 = 0.75 X 107°, 
Bi = Bie = B3 = Y11 = Y12 = Y3 = 107%. 


A judicious choice must be made for the range of coefficient indices 
ky, Ro, and kg in the nonlinear terms making up Ynyz,(n), if the total 
number of {W} and {B} coefficients is to be reasonable, say on the order 
of 100. Obviously, the best choice of indices for a fixed number of taps 
depends on the channel. Trial and error (by no means exhaustive) of 
various sets of indices used in simulations on several voiceband channels 
led to the choice of terms shown in Table I. There are 73 “forward” tap 
coefficients {W}, of which 22 are linear, and 61 “feedback” tap coefficients 
{B}, of which 10 are linear. Note that the nonlinear forward tap indices 
are confined to the range —1 < k < 1 and the nonlinear feedback tap 
indices have been confined to the range 1 <k < 3. 


VI. THE SIMULATIONS 


The nonlinear QAM receiver structure described in the previous section 
was simulated on an IBM 360 computer to process recorded 9600-bps 
QAM data signals that had been received from 17 voiceband telephone 
channels. The simulation effort was an extension of that described for 
linear and decision feedback QAM receivers in Refs. 3 and 9, respectively. 
The set of recorded QAM signals was the same, permitting the perfor- 
mance of all three receiver types to be compared under identical con- 
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Table | — Index terms used in voiceband simulations 


Indices Indices 
Terms ky ke ks Terms ky ko kg 

WP? no —1 = BD: 1 1 
0 0 2 2 

1 1 3 3 

—1 0 2 1 

0 —1 1 2 

0 1 3 2 

1 0 2 3 

—1 1 3 1 

1 —1 1 3 


W\Y (Linear) terms ~—12 to 9 inclusive Bi (Linear) terms 1 to 10 inclusive 


Weds ks Bae and 
and WOrRo ks —l ~~ —1 By cho.ks 1 1 1 
0 0 0 2 2 2 
1 1 1 3 3 3 
-1 -1 0 1 1 2 
-1 0 -1 1 2 1 
0 0 -1 2 2 1 
0 -1 0 2 1 2 
0 0 1 2 2 3 
0 1 0 2 3 2 
1 1 0 3 3 2 
1 0 1 3 2 3 
-1 0 1 1 2 3 
0 1 -1 2 3 1 
~1 1 0; 1 3 2 
Week 1 —1 BY) re 1 1 
-1 0 2 2 
0 0 3 3 
1 J 1 2 
0 1 2 3 
-1 1 1 3 
WE noes —1 = a Be aks 1 1 | 
0 0 0 2 2 2 
-1 —1 0 1 1 2 
0 0 -1 2 2 1 
1 1 1 3 3 3 
0 0 1 2 2 3 
1 1 0 3 3 2 
-1 0 1 1 2 3 





ditions. The set of 17 channels could be described as “‘worse than aver- 
age.” Every channel had at least one impairment equal to or worse than 
the 90-percent point on the nationwide toll connection survey.” 

The transmitted QAM signals had been generated digitally, with two 
pseudorandom four-level information symbol streams in quadrature, 
each repeating after 256 symbols. Each quadrature pair of symbols 
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therefore conveyed four information bits and the symbol rate was 2400 
bauds, making a total bit rate of 9600 bps. The carrier frequency f, was 
1650 Hz, and the double-sideband baseband pulse signal had 12 percent 
roll-off. 

The received signals that were recorded in digital form (12-bit samples, 
24-kHz sampling rate) were received from a variety of real and analog- 
simulated voiceband telephone channels in tandem with an actual 50- 
km, C2-conditioned, N2-carrier voiceband channel. 

As in the simulation of the linear and decision feedback receivers, the 
adaptive passband signal processors [defined by the set of eqs. (4) and 
Table I] were preceded by a pair of fixed digital filters that split the in- 
coming signal into in-phase and quadrature components. Each was 
sampled at time instants t = 7 + nT (n = 0,1,2,---). Each simulation was 
actually of five separate receivers in parallel, with sampling epochs 7 = 
0, 0.2T, 0.4T, 0.6T, and 0.8T. The results reported in this paper are in 
each case for the timing epoch which yielded the best performance. As 
noted previously in Ref. 9, the decision feedback structure generally 
produced a relatively small performance spread between the best and 
the worst timing epochs. The receiver’s decisions A(n) were formed by 
quantizing each equalized demodulated output, in-phase or quadrature, 
into one of the four possible levels £1,.+3. 

Before tabulating the simulation results, we mention some qualitative 
observations. In the interest of reducing the large numbers of nonlinear 
coefficients, it would have been desirable that only a few of the observed 
coefficients be large enough to be significant for all the channels. Un- 
fortunately, this was not the case; no pattern was discernible common 
to all channels of a significant subset of coefficients; typically, the non- 
linear component Yyz(n) in the receiver’s output consisted of a large 
number of small terms, rather than a small number of relatively large 
terms plus insignificant terms. 

Another qualitative observation was that the best values for the ad- 
aptation parameters for the nonlinear coefficients were so small that 
convergence of the nonlinear tap coefficients required at least 2000 
symbol intervals, much slower than the convergence rate of the linear 
coefficients. This is attributed to the high correlation among many of 
the nonlinear terms. For example, the term |A,,|2Az, is positively 
correlated with the linear term Ax,, since | Az,|? takes only one of the 
three possible positive values 2, 10, or 18. Under such circumstances, the 
A matrix which describes the correlations among all the terms is ex- 
pected to have a rather large eigenvalue spread, necessitating small 
adaptation constants and slow convergence.!8 

During each run, after an initial training period of 2000 symbol in- 
tervals to allow the coefficients to converge to nearly stationary values, 
the simulated receivers switched to a decision-directed mode in which 
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their decisions A(n), right or wrong, were used in the adaptation and 
decision feedback operations. Since the true transmitted information 
stream {A(n)} was known, the performance was measured by observing 
the number of decision errors made during 7000 symbol intervals (or 
28,000 bits). The empirical probability of the sampled analog error Y(n) 
— A(n) was also measured, and if no errors were observed during a run, 
the error probability could be roughly estimated by extrapolating the 
tail of this distribution, using a computer subroutine by S. B. Wein- 
stein.14 The tabulated error probability, p., is the probability that a 
four-level symbol is in error; i.e., it is roughly twice the bit error rate. 
Another tabulated measure of performance was the output SNR, defined 
by 

(JA(n)|?) 

(|E(n)|?) 


where “( _)” denotes the time average. 


output SNR = 


Vil. QUANTITATIVE RESULTS 


The simulation results for the NL receiver are tabulated in Table II 
along with the corresponding results taken from Ref. 9 for the LE and 
DFE receivers. For each channel, Table II lists the measured impairments 
and the error probabilities (either observed or extrapolated) for the LE, 
DFE, and NL receivers. The quantity in parentheses below each error 
probability is the output SNR in decibels. Error rates below 10-5 were 
extrapolated; in some cases in which the tail of the empirical probability 
distribution of the quadrature components of E(n) was markedly non- 
Gaussian, the extrapolation yielded limited accuracy. Figure 4 illustrates 
the nonlinear compensation for channel 14, which had unusually severe 
second-harmonic distortion. Figure 4 is plotted on a “probability scale;” 
i.e., a Gaussian error distribution function would plot as a straight line 
on it. The distribution function for the linear receiver has distorted tails, 
indicating the presence of residual nonlinear distortion. However, the 
curve is nearly straight for the NL receiver, indicating that nonlinear 
distortion components have been substantially removed. 

Comparison of error rates for the three receivers on all the channels 
is displayed more dramatically by the bar graph of Fig. 5. In all cases, 
the performance of the NL receiver surpassed that of the other two re- 
ceivers. (Note that measurable nonlinear distortion was observed on all 
the channels.) In most cases, the NL receiver afforded a greater im- 
provement in error rate over the DFE receiver than did the DFE receiver 
over the LE receiver. This is a very significant point. It indicates that if 
9600-bps voiceband modems are to be improved by more sophisticated 
signal processing at the receiver, it is more fruitful to attempt to over- 
come nonlinear distortion than to concentrate on more sophisticated 
receiver structures, optimal for linear channel models. 
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’ Error rates 


Slope (dB) 

Signal-to- 
noise 
ratio (dB) 

Second 
harmonic 


Measured 
Impair- 
ments 


harmonic 
(dB) 
Phase jitter 
(peak-to- 
peak) 
Linear 
equaliza- 
tion 
Decision 
feedback 
equaliza- 
tion 
Nonlinearity 
equaliza- 
tion 


(output 
SNR) 


Table Il — Experimental comparison of LE, DFE and NL receivers 
(facility in tandem with Holmdel-Murray Hill N2-carrier line) 


No.5 
none 


0 
29.0* 


33.5 
44 
<3° 
1 x 1078 
(28.0 dB) 
2x 107-9 


(27.8 dB) 


2x 10711 
(29.4 dB) 


No. 6 


Private N 
Carrier to 


White 
Plains 


2 
22.5% 
28* 
31* 
<3° 
2x 10-6 
(22.4 dB) 
4x 10-6 


(22.8 dB) 


1X 10-6 
(24.9 dB) 


No. 7 
Line 
Simulator 


9 
31 


1x 10-5 
(23.7 dB) 


No. 8 
Private T1 No.9 
Carrier to Line 


Newark Simulator 


4.4 -2 

27* 30 

35 251 

40 33 

<3° 14° 

(120 Hz) 
1x10-5 7x 1073 
(20.8dB) (15.0 dB) 
4xX10-§ 9x1073 
(21.2dB) (15.0 dB) 
7X10-7 2x10-4 
(22.3 dB) (18.2 dB) 


No. 10 
Line 


Simulator 


3 
35 


32.1* 


47 


17°F 


(50 Hz) 


8 x 1076 
(23.9 dB) 


5x 1077 
(23.8 dB) 


3X 107-8 
(24.7 dB) 


No. 11 
Line 
Simulator 


11* 
24.4* 
28.6* 


36.4 


3X 1077 
(21.7 dB) 


No. 12 
Line 
Simulator 
11* 

33 

33.8 
49 
<3° 
3 xX 1077 
(22.6 dB) 
7X 10710 
(24.6 dB) 


3X 1071? 
(28.6 dB) 


No. 13 
Line 
Simulator 


0 
34 


34.4 
30.7* 
<3° 
3 x 10-6 
(23.8 dB) 
3x 1078 


(23.9 dB) 


1 xX 107-12 
(27.5 dB) 
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Table II (cont) 





Measured 
Impair- 
ments 


Error Rates 
(output 
SNR) 


Slope (dB) 

Signal-to- 
noise 
ratio (dB) 

Second 
harmonic 


harmonic’ 
(dB) 
Phase jitter 
(peak-to- 
peak) 
Linear 
equaliza- 
tion 
Decision 
feedback 
equaliza- 
tion 
Nonlinearity 
equaliza- 
tion 


No. 14 
Line 


Simulator 


0 
31 


20.6" 

49 

<3° 
2X 1074 
(19.4 dB) 
8 x 1075 


(19.5 dB) 


2x 107-9 
(26.8 dB) 


No. 15 
Line 


Simulator 


0 
31 


27.2* 

32* 

<3° 
1x 1076 
(24.5 dB) 
5 xX 1077 


(24.5 dB) 


1 X 10710 
(28.4 dB) 


No. 16 
Line 
Simulator 


12? 
23t 


25.2 
30.3 
15°t 


(120 Hz) 


1.7 X 107? 
(14.0 dB) 


3 xX 107-2 
(14.1 dB) 


1 X 1072 
(15.6 dB) 


No. 17 
Line 


Simulator 


11.1* 
29* 


32.2* 


34,7* 


10°? 
(120 Hz) 


3x 1073 


(17.4 dB) 
2x 1073 
(18.6 dB) 


3 x 1075 
(20.8 dB) 


No. 18 
DDD 


Loopback 
to Dallas 


7.8 
29* 


1x 1078 
(18.5 dB) 


9x 1074 
(18.3 dB) 


5 X 1075 
(20.4 dB) 


No. 19 
(Private T1 
Carrier 
to Newark 
13t 
28* 

31.8* 

37 

<8? 
2.1 X 1073 
(18.3 dB) 


2x 1073 
(19.9 dB) 


No. 20 


Private T1 
Carrier 
to Newark 


6 
24.8* 


24.4 
32.6* 
<3° 

5 X 1074 


(18.4 dB) 


3X 1074 
(18.7 dB) 


No. 21 
Private T1 
Carrier 
to Newark 


8 
23.21 
24.6t 
28.6t 

<3° 

1.6 X 10-3 

(17.8 dB) 
3x 10-3 


(18.0 dB) 


7x 1075 
(19.6 dB) 





* Indicates worse than 90-percent point in the nationwide toll connection survey. 
t Indicates worse than “worst case” 3002 channel impairment limit. 
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Fig. 4—Comparison of distribution functions of the receiver output Y(n) for the linear 
and nonlinear receivers (data from channel 14). 


Note in Fig. 5 that, for some of the channels, the nonlinearity equali- 
zation reduced the error rate by two or three orders of magnitude. 
However, on other channels, such as 9 and 16* which had most of their 
impairments in the “severe” category, the error rate was high and the 
NL receiver afforded very little improvement. 

An interesting statistic that can be gleaned from Fig. 5 concerns the 
ability of the NL receiver to increase the number of channels which yield 
error rates below a specified maximum. For example, 15 of the 17 
channels yield an error rate of better than 104 with the NL receiver, but 
only 8 of 17 meet this error rate standard with the LE receiver. For a 
maximum error rate of 10-5, the number of channels is 10 with the NL 
receiver and 7 with the LE receiver. For a maximum error rate of 10-§, 
the numbers of channels are 9 and 3 with the NL and LE receivers, re- 
spectively. 

The price paid for the better performance of the NL receiver is, of 
course, its increased complexity, measured by the number of terms 
comprising Yyz,(7n) in eq. (4) and its slower convergence. The effect of 
reducing the number of terms, and therefore the complexity, is treated 
in the next section. 


* Channel 16’s impairments, produced by a line simulator, were all ‘‘worst case” 
values. 
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Fig. 5—Comparison of error rates for the three receivers. 
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Vill. MODIFICATIONS OF THE NONLINEAR RECEIVER STRUCTURE 


8.1 Reductions of the number of nonlinear tap coefficients 

(i) The tap coefficients {W{2?, ,,,| and {B}}?}, ,,} were set to zero, re- 
ducing the total number of nonlinear forward and feedback taps to 37 
each. The measured output SNRs for most of the channels were slightly 
less than those for the full complement of 51 forward and 51 feedback 
taps, as illustrated in Table III. 

(it) A different set of 100 nonlinear terms was created by eliminating 
all cross-product terms and extending the time span covered by the 
forward and feedback terms to 10 symbol intervals. Thus, the forward 
tap coefficients consisted of {W{)}, {WEP} (WEP, (W, (ay, and {Wi{lp) 1 
where —5 <k < 4, and the feedback terms consisted of {B{°}}, Bub, 
{BE Pe}, {BEL}, and (B ®).,}, where 1 < k < 10. Some resulting output a 
are tabulated in part (iii) following. 

(iit) A smaller set of nonlinear taps was created by taking a subset of 
46 of the original set of 102 nonlinear taps. The resulting output SNRs 
for several channels are shown in Table IV, along with the corresponding 
set of SNRs from the original NL receiver structure with 102 nonlinear 
taps and also from the receiver with 100 nonlinear taps, described in item 
(ii), above. 

The results of items (7), (iz), and (iii), compared with the original re- 
sults using the NL receiver with 102 nonlinear tap coefficients indicate 
that a large number of nonlinear correction terms is necessary to yield 
substantial performance improvement. Undoubtedly, still better per- 
formance would have been attained by using more than 102 nonlinear 
taps. The results of item (ii) also showed that elimination of the cross- 
product terms degraded performance, even though the remaining non- 
linear terms encompassed a longer time span. 


Table Ill — Output SNR (dB) for nonlinear receivers 
Channel 102 nonlinear taps 74 nonlinear taps 
9) 29.4 29.4 
6 24.9 23.8 
7 23.7 22.7 
8 22.3 22.1 
9 18.2 17.5 
10 24.7 24.5 
11 21.7 21.5 
12 28.6 27.5 


Table IV — Output SNR (dB) for nonlinear receivers 


Original 
Channel (102 Taps) (it) 100 Taps (tit) 46 Taps 
9 18.2 17.7 17.5 
13 27.5 25.8 25.7 
14 26.8 23.7 23.0 
15 28.4 25.6 26.5 
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(tv) The number of nonlinear taps was also reduced to 46 by elimi- 
nating all coefficients {W!),.}, [BP no}, {WE xorg}, and {BE no 2}. The re- 
sulting output SNR on channel 14 was only 21.3 dB, as compared to 24.8 
dB for 102 nonlinear taps. Thus, it appears that at least the last four sets 
of coefficients (associated with second and third harmonics of the carrier 
frequency) are significant and should be retained. 


8.2 A variation in the receiver structure tested for channel 20 


The forward nonlinear tap coefficients weight various quadratic and 
cubic products of the sampled received signals. One might speculate that 
if linear distortion were removed from the received samples before their 
nonlinear processing, the nonlinear distortion remaining in the output 
might be further reduced. Accordingly, we simulated an NL receiver 
structure which was the same as that shown in Fig. 4 except that there 
are no linear feedback taps and the input to the forward nonlinear taps 
comes from the output of the linear forward taps instead of directly from 
the phase splitter. Since the adaptive linear forward taps, constituting 
the passband equilizer, are in tandem with the adaptive nonlinear taps 
in this structure, the mean squared error is not a convex function of the 
nonlinear tap coefficients, and hence the question of convergence is more 
complicated. Nevertheless, this structure was simulated on channel 20. 
The resulting output SNR was 20.0 dB compared to the 20.6 dB obtained 
from the original receiver structure. Thus, prior linear equalization did 
not appear preferable. 


IX. CONCLUSIONS 


The simulations have demonstrated that nonlinearity-equalizing QAM 
receivers can provide substantially better performance than can con- 
ventional linear or decision feedback equalization receivers over a variety 
of voiceband telephone channels. This encouraging result may stimulate 
further research aimed at finding less complicated receiver structures 
for overcoming channel nonlinearities. 

The number of nonlinear terms that can be considered for inclusion 
in the NL receiver’s analog output Y(n) is potentially enormous. For 
example, the number of different terms R(k1)R(k2)R(k3)* for all indices 
ky, Ro and kg between —N and +N is (2N + 1)2(N + 1), which is much 
more than (2N + 1), the corresponding number of linear terms {R(k)} 
in that range of indices. The simulation results indicated that inclusion 
of a large number of nonlinear terms, including “cross-product” terms 
for which k; ¥ ko ¥ kz, may be necessary. Reductions in the number 
of terms and a variation of the NL receiver’s structure, in which adaptive 
linear processing preceded nonlinear processing, resulted in worsened 
performance. 

Perhaps the major conclusion to be drawn concerns means for im- 
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proving the reliability of high-speed data transmission over the popu- 
lation of voiceband telephone channels. The simulations reported in Ref. 
9 showed that decision feedback equalization, which is known theoret- 
ically to be superior to linear equalization in overcoming severe linear 
distortion, only moderately bettered the error rate obtained with linear 
equalization, especially on voiceband channels meeting C2 conditioning 
standards. However, the results summarized by Fig. 5 indicated that 
there is more to be gained by mitigating nonlinear distortion than in 
using more elaborate methods (beyond linear or decision feedback 
equalization) of mitigating linear distortion. 


APPENDIX 


In this appendix, we derive the form of the analytic signal that emerges 
from the summed filtered outputs of the quadratic and cubic non- 
linearities. The real and imaginary parts of this analytic signal will then 
be the in-phase and quadrature components, respectively, of the non- 
linearly distorted received QAM signal. The following theorems, proven: 
in Ref. 11, will be required: 


Theorem 1: Given real waveforms u(t) and v(t), defined on —~ <t < 
© with respective Hilbert transforms u(t) and U{t), the convolution 


w(t) = fe ijuk—pdr (7) 
has Hilbert transform 
w(t) = f. “Sue =aer= f * SORE = Der (8) 


Thus, if v(t) is the input to a filter whose impulse response is u(t) , the 
analytic output signal is 


w(t) + j(t) = f.. (v(r) + j0(r))u(t — 1dr 


A f. " v(r)(u(t — 7) + ju(t — 7))dr. (9) 


Theorem 2: The analytic signal resulting from the convolution can also 
be expressed as 


w(t) + jub(t) = 5 f. (v(r) + j0(r))(u(t — 7) + jut — 7))dr. (10) 
Now we consider an analytic signal of the form 


X(t) = effet 5 A(n)F(t — nT), (11) 
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as in expression (2) of the text. The squaring and cubing elements in Fig. 
2 operate on x(t), the real part of X(t). The response of the squaring 
element to Re(X (t)) can be written 


x(t)? = = Re | ester > A(n,)A(na) F(t — ny T)F(t — na?) | 


n1,N2 


“++ = | > A(n,)A(no)* F(t = nyT)F(t — na) | ‘i (12) 
2 ni,n2 

Of the complex expressions in square brackets in (12), the first is 
complex and analytic, since it is the square of an analytic signal (its 
spectrum is nonzero only for positive frequencies). Thus, from Theorem 
2, the analytic signal that results from passing the first part of expression 
(12) through a passband filter 2 is of the form 


Unlt) = ef4tfet 3° > A(ny)A(n2)Go(t — ni T,t — neT), (13) 
ny ng 
where Go(t — n1T,t — neT)e/4*fct is a complex analytic waveform, whose 
spectrum has been limited by filter 2 to 0 < f < 2f,. The second term in 
(12) is baseband, real, and not analytic.t However, from Theorem 1, the 
analytic signal resulting from passing the second term through filter 2 
has the form 


Up(t) = S Al(ny)A(ne)*Golt — n1T,t — neT), (14) 


where Go(t — n1T,t — n2T) is an analytic waveform, whose spectrum is 
confined to 0 < f < 2f¢. 

The cubic nonlinear terms are handled similarly. The cube of the input 
signal Re(X (t)) can be written 


x(t)? = {Re | ede > A(n,)A(n)A(n3) F(t — n1T) 


nN1,N2,NZ8 
-F(t — neT) F(t -— ns?) | 
+ : ei2nfet 5 A(m)A(no)A(ns)*F(t — ny T)F(t — no) 
N1,N2,N38 


F(t = ngT)* + > e~V2atet ¥  A(ny)*A(n9)*A(n3) 


n1,722,N3 
»- F(t —n,T)*F(t — noT)*F(n —ngT). (15) 


The first term in square brackets (15) is analytic, being the cube of an 
analytic signal. The other two terms in (15) are not analytic, since their 


t The ranges of the indices n1 and ng in (12) and (13) are assumed to be the same. 
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Fourier transforms are not necessarily zero for negative frequencies. The 
analytic signal resulting from passing x(t)? through bandpass filter 3 can 
be written by applying Theorem 2 to the first term of (15) and Theorem 
1 to the second and third terms. The resulting analytic signal is the sum 
of three analytic signals, U3(t), Ui,(t), and Uy2(t), which have the fol- 
lowing forms: 


U3(t) = e/6tfet = S~  A(ny)A(n2)A(nsz) 


11,N2,N3 


: G3(t = nyzT,t 7 nol t _ ngT'). (16) 
Uyy(t) = ef@tfet SS A(ny)A(n2)A(ns)* 


N1,N2,N3 


: Guilt om nyzT,t = not = ng’). (17) 


Uyo(t) = eJ2rfet SA (ny)*A(ng)*A(n3) 


n1,N2,n3 


. Gyolt = n1I,t = nol ,t = ng3T'). (18) 


The G(_) signals are complex, and the spectra of the analytic signals 
U3(t), U1,(t), and Uy9(t) are all confined to the range 0 < f < 2f, by 
bandpass filter 3. 
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Spectral Sharing in Hybrid Spot and Area 
Coverage Satellite Systems via 
Channel Coding Techniques 


By A. S. ACAMPORA 


(Manuscript received December 1, 1977) 


Multiple spot-beam switching satellites employing frequency reuse 
are considered, and a method for incorporating an area coverage beam 
to provide service to those regions not covered by the footprint of any 
spot beam is proposed here. The method consists of employing a con- 
volutional code for the area beam transmission to enable sharing of a 
common spectral band among the spot and area beams on a noninter- 
fering basis and with no sacrifice in the capacity of the spot beams. A 
maximum-likelihood algorithm for this purpose is derived, and bounds 
on the bit error rate performance of all beams are found. Results show 
that excessive performance degradation arising from cochannel in- 
terference is limited to a thin annular ring surrounding each spot 
beam. 


I. INTRODUCTION 


Multiple spot beam communication satellites offer the potential for 
greatly increasing the traffic handling capability relative to wide-area 
coverage systems, since the allocated spectral band can be reused in the 
various spot beams.!? A high-level block diagram of the satellite tran- 
sponders for such a system might appear as shown in Fig. 1. Here, the 
various service regions are interconnected via an on-board switching 
matrix operating in the time-division mode, and digital modulation 
techniques consistent with time-division multiple access (TDMA) are 
employed. 

As previously noted,? such a system suffers a serious drawback in that 
a large blackout region, serviceable by none of the spot beams, is created. 
The situation is depicted in Fig. 2, which shows the radiation footprints 
of a hypothetical 11-beam private line system serving the large popu- 
lation regions in the United States. Although most of the traffic load for 
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Fig. 1—Satellite transponder. 





Fig. 2—Footprints of a hypothetical 11-beam system showing —1, —2, and —3 dB con- 
tours. Both polarizations are employed. 


such an offering would be adequately served by the 11 high-capacity spot 
beams, it is nonetheless desirable to provide service to the outlying 
areas. 

Among the various techniques proposed in Ref. 3 for coping with this 
blackout problem, the method of deploying a channel coded area cov- 
erage beam, in addition to the various uncoded spot beams, appears most 
attractive in that the blackout region is reduced to a thin annular ring 
surrounding each spot beam. This method offers the additional advan- 
tage of reducing the required radiated power for the area coverage beam, 
an important consideration since the gain of the area beam antenna port 
might be 20 dB lower than that of a spot beam port. In this paper, we 
review the principles involved in this approach and derive bounds on 
the resulting bit error rate performance of both the spot and area 
beams. 

In Section II, we discuss the problems associated with sharing a 


2614 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1978 


common spectral band between area coverage and spot coverage satellite 
beams. Section III is devoted to the derivation of a detection algorithm 
for such a hybrid system in which convolutional coding is used to alle- 
viate the effects of cochannel interference, and bit error rate bounds are 
found. In Section IV, these results are applied in a typical communication 
satellite scenario. 


ll. PROBLEM DEFINITION 


Consider a satellite system consisting of M spot-beam transponders 
serving M geographically separated, high-traffic demand areas on a 
noninterfering basis. The allocated spectral band is totally reused in the 
M spot beams. We wish to deploy an area coverage beam, in addition to 
the M spot beams, to provide service to the low traffic demand outlying 
regions serviced by none of the spot beams. The total traffic demand to 
all outlying regions might be of the same order of magnitude as the de- 
mand for one spot beam. Service to the outlying regions must be provided 
on a noninterfering basis and with no sacrifice to the capacity of the 
various spot beams. We assume that the spot beams require use of both 
electromagnetic polarizations to minimize mutual interference among 
themselves. 

Four types of interference are readily identified: 

(t) Down-link: The area coverage radiation is detectable at every spot 
beam receiving terminal and can thereby interfere with reception of the 
desired signal at those ground stations. 

(ti) Down-link: The spot beam footprints might typically be useful 
out to their —3 dB radiation contours. Area-coverage receiving terminals 
located at the —3 dB through the —20 dB contours of any spot beam 
thereby suffers interference from that spot beam. 

(tit) Up-link: All up-link transmissions from spot-beam earth ter- 
minals are detectable at the antenna port of the area coverage beam and 
thereby interfere with reception of the area coverage up-link transmis- 
sion. 

(tv) Up-link: Transmission from an area coverage ground station 
located between the —3 dB and the —20 dB contour of a spot-beam an- 
tenna pattern could interfere with that spot beam’s up-link transmis- 
sion. 

Thus, the inclusion of an area beam might make the original spot 
beams totally unusable. To eliminate these interference problems, one 
might split the allocated spectral band into two components; one seg- 
ment would be dedicated to the area coverage transponder and the 
second segment would be reused among the various spot beams. If this 
is done, the system designer must choose one of two options: 

(1) Reduce the throughput of the spot beam transponders by that 
fraction of the satellite band dedicated to the area coverage beam. 
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(it) Maintain the original throughput of the spot beam transponders 
while increasing the effective radiated power on both the up-link and 
the down-link to overcome the degradation caused by excessive band- 
limiting. . 

Option (i) results in a sizable decrease in the overall capacity of the 
satellite. Consider a 10-spot-beam system with each beam occupying the 
entire spectral band. The normalized throughput of such a system is 
defined to be 10 units. Suppose that two area coverage beams are added 
to the system (one using each polarization) and that one-half the band 
is reserved for the spot beams. Then, under option (i), the normalized 
throughput is reduced to 44 X 10 + % X 2 = 6, and the overall system 
throughput is reduced by 40 percent. For the same fractional split of the 
total bandwidth, option (ii) could incur a power penalty in excess of 6 
dB for a 4¢-CPSK (coherent phase shift key) system originally operating 
at a modest BT (bandwidth-time) product of 1.3. Such a penalty might 
be acceptable on the up-link, but would typically be unacceptable on the 
down-link since space platform power is a limited resource. 

Thus, to provide service to the outlying area at no sacrifice in either 
the throughput of the spot beams or in the required spot beam effective 
isotopic radiated power (e.i.r.p.), one might consider splitting the band, 
as described above, to eliminate up-link interference. Up-link digits 
would be regenerated, switched, and reformatted into the appropriate 
down-link port. A suitable scheme must then be sought to accommodate 
the down-link. 

Channel coding techniques will be investigated as a possibility. The 
motivation for such an approach is twofold. First, and most important, 
coding can provide for effective immunity against cochannel interfer- 
ence. Second, we note that because of the difference of about 20 dB be- 
tween the antenna gains of the area and spot-beam coverage antenna 
ports, a system would require 20 dB more power for the area coverage 
port than for a spot-beam port to achieve the same bit error rate per- 
formance. Through use of coding, we can effect a considerable reduction 
in the required power for the global beam. 

The scenario envisioned, shown in Fig. 3, would employ uncoded 
transmission for the spot-beam messages and rate r = 44 convolutionally 
encoded transmission for the area beam port. The throughput of the area 
beam port would be one-half that of the spot-beam port, implying that 
the down-link channel symbol rate for all beams are the same. In addi- 
tion, since on-board regeneration is employed, all down-link channel 
symbols can be time-aligned. We will consider 4¢-CPSK modulation and 
explore in detail the situation where the in-phase and quadrature rails 
of both the area and spot coverage beams are modulated separately and 
where there is no crossrail coupling. Thus, we can consider baseband 
performance. The algorithms and other results to be presented are 
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Fig. 3—Regenerative transponder for a hybrid spot-area coverage satellite employing 
channel coding. The allocated band is split between the spot and area beams on the uplink. 
The band is totally reused for all the downlinks. 


readily generalized to the situation where there is a known, fixed carrier 
phase shift between the spot and area transmissions. 


lll. BIT ERROR RATE PERFORMANCE 


We now investigate the bit error rate performance of both the uncoded 
spot beam message and the encoded global beam message in the inter- 
ference-prone region surrounding one of the spot beams. We need to 
consider the presence of only one such spot beam since, in the footprint 
area of that beam, interference from the remaining beams is negligible. 
At a particular ground station, after coherent demodulation, we observe 
the following received baseband process: 


R(t)=VE, x bph(t — kT) + VEo x y, (a)h(t —kT) + n(t). (1) 


In (1) above, b; is the kth member of the binary data stream b of the 
uncoded spot beam message, a represents the binary data stream for the 
global beam, y; (a) is the kth channel symbol of the global beam and is 
dependent upon a through the structure of the encoder, h(t) is the im- 
pulse response of the channel, n(t) is a Gaussian noise process of spectral 
power density No/2, and E, and Ey» are, respectively, the received pulse 
energy of the spot and global beam transmissions. We note that the b;’s 
are independent and equally likely to be +1, and that the y;’s can assume 
the values +1 but are not independent. We assume that intersymbol 
interference is absent. 

A set of sufficient statistics* for detecting the a and b sequences is 
formed by the synchronous samples of R(t) taken at the opening of the 
binary eye. One such sample is: 
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rp = VE, bp + V Eo Ye + Me. (2) 


We assume the various n;’s to be independent. From the samples (2), 
we form the log-likelihood function or path metric* 

A(a,b) = 2 Srp [WE yb, + VEgyn(a)] — C[V Eide + VE oye (a)]? 

(3) 
and decide upon those sequences a,b for which (3) is maximized. 

The maximum-likelihood algorithm to perform optimum detection 
is similar to the Viterbi algorithm? and is illustrated by the state tran- 
sition diagram of Fig. 4, drawn for a K = 3 convolutional code. The state 
is defined by the contents of the first two stages of the shift register, and 
knowledge of the starting state and the next bit entering the encoder 
uniquely determines the next state and the encoded channel symbols 
generated. We note that, unlike the ordinary Viterbi algorithm for rate 
r = 14 codes, each transition between states can occur along four paths, 
rather than one, because two independent uncoded symbols are also 


CHANNEL SYMBOLS 





i : ‘1011 
1000 1001 ‘010 


Fig. 4—-State diagram for maximum-likelihood detection of interfering coded and un- 
coded signals. A K = 3, r = %) convolutional code is assumed. For each transition, the first 
two channel digits correspond to the coded symbols, and the second two correspond to 
the uncoded symbols. 
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generated during each epoch. The first two bits appearing along each 
branch correspond to the encoded channel symbols for that transition, 
and the second two digits correspond to one of the four possible two-bit 
sequences for the uncoded transmissions. 

To perform maximum-likelihood detection, we note that, at each state, 
eight possible branches merge, and the partial path metric of one such 
merging branch must be the largest. The remaining seven paths then 
cannot be most likely because any succeeding additions to any one of 
these seven paths are valid additions to that one path exhibiting the 
greatest partial metric; succeeding additions, then, cannot cause the 
overall metric of any of these seven paths to exceed that of the path ex- 
hibiting greatest partial metric, and the seven paths having the smaller 
path metrics can be deleted from further consideration. 

Thus, at each point in time, the four most likely paths (one leading 
to each state) and their associated partial metrics are known. During the 
next clock cycle, we determine the most likely of eight paths leading into 
each state by performing, for each of two initial states and for each of 
four branches for each initial state, the operation 

1 
An = An-1 + 2» [2ron—k VE \bon-k 
~ V E> Yon-k] [WE ybon—k a V Eoyon-z] (4) 
and saving the path and path metric of the largest for subsequent op- 
erations. The values of bo,-, and yon—p, k = 0,1 to be substituted into 
(4) are determined from the state transition diagram, Fig. 4. 

To perform true maximum likelihood detection, the most likely path 
leading into each state must be stored over the entire past. However, it 
has been shown that after 4 to 5 constraint lengths have elapsed, the 
oldest bits in all path memories are the same with a very high probability. 
Thus, we need to save only the most recent 4K through 5K of data for 
each state and, once in each epoch, the oldest bits in any one of four path 
memories can be outputted as detected data. We note that, unlike the 
ordinary Viterbi algorithm for which each path memory consists of a 
single rail of data, here we need to store three rails of data for each state. 
One rail contains the most likely source sequence a for the area coverage 
beam, and the second two contain the first and second source bits 
emitted each epoch for the uncoded spot beam sequence b. 

The detector will commit an error for the first time at node n if the 
partial metric of some path which previously diverged from the correct 
path and remerges at node n is greater than that of the correct path. 
Some possible error events are shown in Fig. 5. We now calculate the 
probability of such an event. 

Let A correspond to the spot and area coverage information sequence 
along the correct span, and let A be those along the incorrect span. Then, 
the path metric difference is given by: 


SPOT BEAM SWITCHING SATELLITES 2619 


STATE _— ALL—ZEROS PATH 
0000 0000 0000 0000 0000 “ ASSUMED CORRECT 








00 


CASE 2 ~ 





2, 
e 
> 





01 e 


Fig. 5—Select error events for the maximum-likelihood detector. 


A(A) — A(A) = 23¢r,[WEq (by — by) + VEa(yn — 9x)I 
— S[(WE yb, + VEoyn)? — (WE yb, + VEodn)?], (5) 


where the summation is performed over the unmerged span. Substituting 
(2) into (5) and recognizing that b? = 6? = y? = 9? = 1, we obtain 


A(A) — A(A) = A + neq, (6) 
where 
A = 4D [VE bp + VE ox)? (7) 
and 
Neq = 4 one [WE bp + VE ove. (8) 


In (7) and (8), we have used the nomenclature 


~ br if bp = bp 
6, = 9 
: ig otherwise. °) 

Yr Wn =Ye 
= 10 
oe ie otherwise. tO) 


The first event error probability P is equal to the probability that 

A (A) — A(A) <0. From (6) through (8), we conclude that 
4/22 Erbe + VEovp)? 
P= Q| | 
No 

where Q is the complimentary error function. From this result, we now 
derive upper bounds on the bit error rate performance of the coded and 
uncoded transmission. We do the uncoded first. 

Let the unmerged span be L channel digits long. We see from (11) that 
the first event error probability is dependent upon the correct sequence 


(11) 
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along the unmerged span. For each possible error path of length L, we 
will determine the number of uncoded bit errors experienced along that 
path, and average (11) over all possible correct sequences. An upper 
bound on the average bit error rate for the uncoded transmission is then 
given by the summation over all possible incorrect paths, of the product 
of the number of bit errors experienced along a particular path and the 
average probability that the particular path has a metric exceeding that 
of the correct path. 

Let the coded channel bits be different along the correct and incorrect 
paths in D symbols. Let the number of channel symbols for which an 
error occurs for both the coded and uncoded bits be denoted by r, and 
let the number of channel symbols for which an error occurs for the 
uncoded, but not for the coded, be denoted by s. Since the uncoded 
transmissions are equally likely to be +1, then along any L,D,r,s path, 
the coded and uncoded symbols may add or subtract, depending on the 
particular correct path. In 14” of the paths, the correct symbols of the 
coded and uncoded transmissions will algebraically subtract over all r 
symbols. Similarly, in (j)/2” of the paths, there will be a subtraction in 
(r — J) symbols and an addition in j ee In s symbols, 67 = 1 and 
yp, = 0, while in (D — r) symbols, 5, = 0 and #7 = 1. There arer + s errors 
committed in the uncoded transmission. Thus, averaging over all possible 
correct paths of the same L,D,r,s, we obtain the result that the average 
probability of error for each path of the same L and D for the uncoded 
transmission is given by 


1Z-D D /L—-Dy /Dvr+s 2 ‘ 
r= EET) YEEEE (araip, a2 
where 
Os ee 
[V5 ltr — VB — VED? + VE + VE>)2 + (D —r)Eo+ sky, : 
0 


(13) 


The factor of 14 appearing in front of (12) arises from the fact that two 
uncoded bits are transmitted per epoch. 
Using the inequality that for x = 0, y = 0, 


QvVx + y} S QV xje?, (14) 
we can overbound and simplify (12) and (138) to the following: 
P, <Q [Vv a e-Fi/No[DXeEUNo + L(1 + X) —D] 
0 


X [1 + X)P-11 + e~E/NojL-D-1), (15) 
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where 
~ 1+ e~4VEiE2/No 
Da ree a 
2 
Finally, for a particular convolutional code, we use the code generating 
function matrix method of Viterbi® to identify all L,D paths for that code, 
and sum the contribution (15) for each such path over all possible paths. 
To this result must be added the contribution of the trivial case for which 
no coded errors occur (see Fig. 5, case 1). The contribution of these paths 


is simply —_— ease 
2E AE 
pP,=@lr/ a [Vv wt 1 

b= OV | TeV a, (17) 
The results of this exercise have been applied to the optimum K = 7, 
r = 14 code,® and appear in Fig. 6. Plotted there is the uncoded bit error 
rate bound vs the required energy per information bit-to-noise ratio, 
e,/N>, for various ratios of interference to signal (E2/E,). Also plotted 
is the ideal, interference-free performance. We see here that as E/E, 
decreases below 2.5 dB, performance starts to improve. The utility of 
the maximum likelihood sequence estimation (MLSE) to detect uncoded 
transmission in the presence of coded area coverage interference is il- 


e7~(E1-2V E1E2)/No. (16) 
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-8dB 
—4dB -3dB —2.5dB }E/E, 


—5dB 


BIT ERROR RATE 
7 
Loa] 





e,/Nq IN DECIBELS 


Fig. 6—Bound on uncoded bit error rate performance of maximum-likelihood detector 
vs e,/No for select values of interference. ° 
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lustrated by the following example. Suppose E./E, = —3 dB. Then, if 
simple bit-by-bit detection is performed, an asymptotic degradation of 
about 10 dB from ideal would be expected. However, through use of the 
MLSE, the asymptotic degradation is about 1 dB. 

We also see from these curves that, as E./E, decreases below about 
—8 dB, there is an apparent degradation in performance. This virtual 
result is caused by the bounding technique used, and is not experienced 
in practice. To see how this arises, we note that, as Eo/No becomes small, 
all paths through the decoding trellis exhibiting a fixed number N of 
uncoded bit errors become equally likely. The contribution of each such 
path to the bit error rate bound is, however, summed, indicating a much 
higher bit error rate than would actually be encountered since only one 
such incorrect path could actually be selected at any node. For suffi- 
ciently small E»/N,, in fact, the bound no longer converges. To evaluate 
performance of the MLSE in the regime where the bound converges 
poorly, extensive simulation studies were performed and are shown in 
Fig. 7. These studies show that there is in fact a degradation in perfor- 
mance as E>/E, decreases below —4 dB, but that the worst-case degra- 
dation of about 1 dB from ideal occurs for E2/E; = —10 dB. As the in- 
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Fig. 7—Uncoded bit error rate performance of maximum-likelihood detector vs e,/No 
obtained via simulation for select values of interference. 
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terference becomes smaller, performance begins to approach the ideal, 
interference-free case as intuitively expected. 

We now study the performance of the encoded area beam. Again, let 
the unmerged span be L channel digits long and let the coded digits be 
different along the correct and incorrect paths in D channel symbols and 
N information symbols. Then, averaging overall possible combinations 
of the uncoded symbols, the average number of area beam bit errors 
incurred along any L,D,N path is given by: 


Pane ¥ Ghee 4 (1) Qr.siD), 08) 


=0 r=0 s r 2" j=0 


where Q(r,s,j,D) is given by (13). Invoking inequality (14), we obtain the 
bound: 


P, < Ne-PE2/No(1 + e~e1/No)L-D(1 + X)P, (19) 


where X is given by (16). Once again, we use the generating function 
matrix approach to determine the contribution of each incorrect 
path. 

Results for the optimum K = 7 code appear in Fig. 8. Shown there is 
the bit error rate performance of the encoded area beam message vs 
e,/N, for select values of F,/Eo, the interference-to-signal ratio. We see 
that, when E; becomes much greater than E», performance approaches 
the ideal, interference-free case since, under these conditions, the MLSE 
algorithm exploits the large difference between the signal and interfer- 
ence strengths to correctly decode the small signal. For Ey > Ey, the 
bounding technique again suffers from poor convergence properties, and 
the results are meaningless. Again, extensive simulation studies were 
performed and are shown in Fig. 9. We see that, as expected, the ideal 
interference-free case is approached as E/E, becomes small. 

For all values of E,/E»o, the MLSE algorithm provides the best at- 
tainable performance. However, when E,/E> becomes sufficiently small, 
the improvement possible via MLSE becomes negligible as shown by the 
plots of Fig. 10. These data were obtained experimentally and show the 
bit error rate performance of the ordinary Viterbi algorithm in the 
presence of a single bit-synchronous cochannel interferer. The ordinary 
Viterbi algorithm operates as though no interference was present and, 
unlike the MLSE algorithm, would be useless for E; > Eo. However, for 
E, « Es, performance of the two are about the same, and the slight 
improvement possible via MLSE is not warranted in view of the addi- 
tional complexity incurred. 
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Fig. 8—Bound on coded bit error rate performance of maximum-likelihood detector 
vs e,/No for select values of interference. 


IV. APPLICATION 


We now apply the results of the preceding section to the problem of 
reducing-mutual interference between spot and area coverage beams 
sharing a common spectral band. Let the spot-beam radiation pattern 
be Gaussian-shaped and usable to its ~3 dB contour. In the absence of 
cochannel interference, the e.i.r.p. of the coded global beam would be 
8 dB lower than that of the spot beam at its —3 dB contour for the same 
system outage and bit error rate (BER) performance. This 8-dB factor 
can be broken down into a 3-dB component, since the information rate 
of the global beam is half that of the spot beam, plus a 5-dB component 
representing the coding gain of a K = 7, r = 1% convolutional code. 
Suppose we set E2/E, at the 3-dB contour of the spot beam at —8 dB. 
Then, throughout the spot-beam coverage area, —11 dB < E/E; S 
—8 dB. From Fig. 6, we see that, over this range, the BER performance 
of the uncoded spot-beam message is degraded by at most 1 dB if MLSE 
is employed. By contrast, if bit-by-bit detection of the spot-beam mes- 
sage were employed, the degradation would be between 2.9 dB and 4.4 
dB. 

Let the e.i.r.p. of both the spot and area coverage beams rise by 1 dB. 
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Fig. 9—Coded bit error rate performance of maximum likelihood detector vs ep/No 
obtained via simulation for select values of interference. 


Then, throughout the spot-beam coverage region, the BER performance 
is at least as good as that obtained in the absence of interference with 
1 dB less power. From Fig. 8, we see that, beyond the —3 dB contour of 
the spot beam, we can communicate via the area coverage beam in con- 
junction with MLSE with at most 1-dB degradation from the ideal in- 
terference-free situation provided E,/E, > 5.5 dB. Finally, from Fig. 10, 
we see that we can use the area beam with the ordinary Viterbi algorithm 
provided E,/E, < —12 dB. From these observations, we can construct 
the plot of Fig. 11, which shows the one-dimensional radiation patterns 
of a spot beam and the area beam and the usable regions for the spot and 
area coverage beams in the vicinity surrounding a spot beam. Implicit 
in this illustration is the fact that the e.i.r.p. of both the spot and area 
beams is increased by 1 dB to provide the same grade of service as pos- 
sible with 1 dB less power in the absence of cochannel interference. We 
see that communication via the spot beam, in conjunction with MLSE, 
is employed out to the —3 dB contour of the spot beam. From 0 = 634, 
out to 0 = 1.463gR, we can communicate via the area beam, even though 
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Fig. 10—Measured bit error rate performance of a K = 7, r = 2 convolutional code vs 
e,/N, for select values of interference. The interference is bit-synchronous with the en- 
coded channel bits, and the ordinary Viterbi algorithm with soft (3-bit) quantization is 
employed. 


the interference is stronger than the desired signal. Between 6 = 1.4034p 
and 6 = 2.7503qp, the performance degradation of the area beam exceeds 
the allotted 1 dB, and the desired grade of service cannot be provided. 
This region, then, is blacked out. Finally, for 0 > 2.75634p, communication 
via the global beam is again possible. . 

Thus, through utilization of an area coverage beam in conjunction with 
channel coding and MLSE, the blackout region of a multiple spot-beam 
communication satellite is reduced from the entire region not serviced 
by any spot beam to a thin annular ring surrounding each spot beam. 
There is no sacrifice in the capacity of the spot beams, and the power 
penalty is 1 dB for all beams. 

Let us now consider a specific example. We assume the existence of 
10 spot beams, half of which employ one polarization and half the or- 
thogonal polarization. In the absence of interference, each spot beam 
transponder uses a 3-watt final power amplifier, and the difference be- 
tween the spot and area beam antenna gains is 20 dB. Suppose we deploy 
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Fig. 11—One-dimensional plot showing the usable regions attainable via MSLE for a 
hybrid spot-area beam satellite system employing a K = 7,r = 1/2 code for the area beam 
transmission. The spot beam antenna pattern is Gaussian shaped. 


a single-area beam transponder employing one of the two polarizations; 
the capacity of this beam is one-half that of a spot beam, and a K = 7, 
r = '4 code is employed. In the absence of interference, the RF power 
required of the area beam would be 20 — 8 = 12 dB higher than any spot 
beam. The total required RF power for the hybrid system outlined above 
is then 


P = 1.25 X [8 X 10 + 47.5] = 97 watts. (20) 


By contrast, if we employ the band-splitting technique described in 
Section II, we would need 6 dB more power for each spot beam, and the 
power required for the area beam would be 17 dB higher than that re- 
quired for the spot beam in the absence of interference, since coding is 
not employed. The total power, then, would be 


P=4X3X10+ 50.1 X 3 = 270 watts. (21) 
Considering a 30-percent efficiency for the final TWT, the total dc power 
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required via coding is 323 watts, while that needed for the alternative 
band-splitting approach is 900 watts. 

Since, through use of coding, the dc power required for an area beam 
is only 158 watts, we might consider deploying a second area beam using 
the orthogonal polarization. Then, not only do we double the capacity 
into the outlying region, but we also eliminate the blackout region, since 
each spot beam is used in only one polarization. Area coverage commu- 
nication to the blackout region of one polarization can thereby be pro- 
vided in the second polarization. The dc power required for this approach 
is 442 watts. 


V. MAXIMUM-LIKELIHOOD ALGORITHM WITH FIXED PHASE SHIFT 


In Section III, we derived a maximum-likelihood algorithm which 
allows joint area and spot-beam coverage sharing a common spectral 
band whenever there is no carrier phase shift difference between the area 
and spot beam transmissions. We now derive the proper algorithm for 
use when there is a fixed phase shift difference, 6. During the kth clock 
cycle, the spot beam source emits two bits, b;, and bo», and the area 
beam source emits a single bit a, and two encoded channel bits y1z (a) 
and yo,(a). The data b;;, and bj, are modulated onto a carrier via 
4-PSK, as are y;,, and y2x. Thus, we transmit: 

R(t) = VE\biE cos(wt + 6) + VE jbo, sin(wt + 6) 
oh VE oy 1k cos wt + VEoyor sin wt. (22) 

The receiver locks onto the phase of the encoded area beam and, 
during the kth clock cycle, the receiver observes, after coherent demo- 
dulation, the two test statistics: 


Tur = V Boyirt VE bi, cos8+ VE; bo, sin 6 + ny p (23) 
ror=V E2y2r —-VvV E161 sin O+v E bop cos 6 + NOR. (24) 


The path metric now takes the form: 
A(a,b) = 3 [rig(V Eoyipn + VE }b1, cos 0+ VE, bo, sin 8) 
k 


+ ron(WEoyor — VE1b1, sin 6+ VE jbo» cos 6)] 
— VE; Eolyi,n (01, cos 6 + bo, sin 8) 
— yor(bi, sin 8 — bo, cos A). (25) 
As before, we define the state of the encoder by the contents of the first 
K — 1 stages of its shift register, and each state can be accessed via eight 
paths. Along each path, we compute the partial metric: 
Ap (a,b) = Ap—s(a,b) + V Bor eyip + V Bor e,k¥2,k 
+ VEV(rip — VE2 ¥1,n)(b1,2 cos 8 + bo, sin @) 
— VE\(r2on — V Es ¥2,n)(b1,n sin 6 — box cos 0), (26) 
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and save the path and metric of the larger. Thus, with a fixed known 
phase shift, maximum-likelihood decoding is also possible. 


Vi. CONCLUSIONS 


In multiple spot-beam communication satellite systems, it is often 
desirable to provide service to remote areas not covered by any spot 
beam. This additional service should neither diminish the capacity of 
the spot beams nor cause a severe downlink power penalty. We consid- 
ered deployment of an area beam transponder, in addition to the fixed 
spot beams, and saw that satisfaction of the above requirements implies 
considerable downlink cochannel interference at all ground stations 
located in the vicinity of any spot beam. The use of binary convolutional 
codes for the area beam transmission was shown to greatly curtail the 
performance degradation resulting from this cochannel interference and 
also reduce the prime power requirements of the area beam transpon- 
der. 

A maximum-likelihood algorithm was derived to optimally detect 
either the uncoded spot beam transmission or the coded area beam 
transmission, and performance of this algorithm was evaluated. Use of 
this algorithm was shown to provide for reliable spot-beam communi- 
cation in the presence of cochannel interference. It is also possible to 
reliably communicate via the global beam in the presence of a much 
stronger spot-beam interference. These results were then applied to a 
scenario in which interference was reduced on the uplink via the simple 
technique of band-splitting between the area and spot beams. Such a 
technique is unsuitable for the downlink because of the power penalty 
incurred. On board, the uplink bits are regenerated and switched into 
the appropriate downlink beam, and a K = 7, r = % code is employed 
for the downlink area beam. Results show that the degradation from 
cochannel interference is contained to be less than 1 dB over the entire 
service area except for a thin annular ring surrounding each spot beam. 
Traffic originating within or destined for these blackout rings might be 
backhauled to the nearest serviceable region, or else a second area beam, 
employing the dual polarization, might be deployed such that, for any 
given spot beam, the blackout region is contained to only one polariza- 
tion. Since the spot beams use both polarizations to minimize interfer- 
ence among themselves, the MLSE algorithm must still be used at all 
spot-beam ground stations to provide spot-beam service with minimal 
performance degradation. 

The satellite prime power demands to satisfy RF radiated power re- 
quirements were evaluated and shown to be within the capability of the 
Thor-Delta class. Thus, the use of spot and area coverage beams, sharing 
a common spectral band, in conjunction with channel coding techniques, 
appears to be an acceptable method for providing universal service via 
high-capacity digital switching satellites of the future. 
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Protection Switching System 
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High-capacity transmission systems usually include one or more hot 
spares for protection. When a regular transmission channel fails, its 
signal is rapidly transferred to the spare channel under the control of 
protection switching circuits so that there ts little signal degradation 
or interruption. This paper studies the reliability of a microproces- 
sor-based terminal protection switching system. Some new and inter- 
esting behavior patterns for transmission systems with automatic 
protection switching are revealed. Also, some new memory self-checking 
algorithms are presented which increase the capability of micropro- 
cessor system fault recognition. 


Il. INTRODUCTION 


In high-capacity transmission systems, any failure may affect a large 
number of message circuits. Such systems usually include one or more 
hot spares to increase system reliability. When a regular transmission 
channel fails, its signal is rapidly transferred to the spare channel under 
the control of protection switching circuits so that there is little signal 
degradation or interruption. This paper studies the reliability of a mi- 
croprocessor-based terminal protection switching system (TPSS). The 
specific transmission facility under consideration is the L5E coaxial cable 
analog system, which is an expanded version of the L5 system.! The L5E 
multiplex equipment, or multimastergroup translators (MMGT), carry 
up to eight mastergroups, or 4800 telephone circuits. The TPSS will au- 
tomatically switch into service a protection MMGT in the event of a 
failure of any one of up to 20 MMGTs. 

Reliability theory has been studied by numerous authors,” and al- 
most every Bell System transmission facility with automatic protection 
switching has been the subject of at least one reliability study.4° The 
present analysis was undertaken for several reasons. First, many sim- 
plifying assumptions were made in the previous studies. Not all the 
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effects of the reliability of the switch, the protection switching control 
circuit, and the monitor circuit failures were taken into account. Second, 
in most cases, exponentially distributed restoration time has been as- 
sumed. This means that the probability of restoration at any instant after 
a failure is assumed to be independent of how much time has already 
been spent on restoring the failure. This assumption is rarely true in 
high-capacity transmission systems. Third, only steady-state analyses 
were made. A system with hidden failures will not reach its steady state 
in its lifetime. Fourth, a microprocessor-based protection switching 
control circuit has not been studied in such detail before. Finally, past 
experiences have shown that maintenance-induced service outages 
contribute to a very big share of the total outage time. This study also 
tries to take these outages into consideration. 

With the MMGT system as an example, the present study attempts 
to analyze the same reliability problem in more detail and with less re- 
strictive assumptions. Section II describes the protection switching ar- 
rangement. Section III explains the specific approaches used in this 
paper. Section IV presents the results graphically to emphasize the 
various reliability trends. Section V summarizes the conclusions ob- 
tained. Appendix A investigates some new microprocessor self-checking 
algorithms and Appendix B presents the derivations. 


ll. MMGT PROTECTION SWITCHING SYSTEM DESCRIPTION 


Figure 1 is a simplified MMGT-system block diagram which illustrates 
the 1 X n protection switching arrangement. There is one protection 
channel in each direction of transmission. Under the command of the 
microprocessor, each protection channel protects up to n regular chan- 
nels, where n is equal to 20 in the TPSS. The same processor is used to 
control the switching actions of both directions of transmission. The 
switches are all solid-state devices, and their normal states are indicated 
in the figure. The crucial output switches are dual-powered. Parts of the 
output switch are designated the through switch and the substitute 
switch for later reference. 

When there is no alarm from the various regular pilot detectors, the 
processor exercises the input switches for each channel sequentially to 
detect possible protection failures. In the event of a failure of one of the 
regular channels, the corresponding pilot detector sends an alarm to the 
processor. If the protection channel is available, the processor will first 
switch the input signal through the input switches to feed the protection 
channel. Whether the protection detector indicates a good signal or not, 
the processor will complete the 1 X 2 output switch. The regular detector 
is now monitoring the signal supplied by the protection channel via the 
output substitute switch. If the regular detector still alarms after the 
protection switch, the switching action will be reversed. The 1 X 2 output 
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Fig. 1—TPpss block diagram. 


switch will be deactivated and the input switch released. If the regular 
detector stops alarming after the output switching, a successful pro- 
tection switch has been made, and the protection detector is monitoring 
the failed regular channel. When the failed channel is repaired, the 
protection detector will see a good signal, and the switches will return 
to their normal states..The protection channel is then free to service 
another regular channel failure. 

Service outages can occur in many ways. In addition to multiple 
transmission failures, they can also be generated by the failures of the 
detectors, the switches, or the microprocessor system. The various failure 
modes are taken into account in later derivations. 


Ill, APPROACHES 


Two reliability measures of interest in transmission systems are used 
in this study. The first measure is the probability of service outage due 
to equipment failures. This probability translates directly to the system 
outage time per year and is the most commonly used figure of merit in 
determining transmission system reliability. The second measure is the 
probability of having maintenance activities going on. This measure will 
be abbreviated as the probability of activity. It is believed to be closely 
related to the probability of having maintenance-induced outages. This 
probability of activity is greater than the probability of having alarms 
because there are failures that cannot be detected locally. For instance, 
if the pilot detector for a failed regular channel is stuck to the state of 
no alarm, the failure can only be detected by downstream offices. Thus 
there may be maintenance activities in an office but no alarm. The 
probability of activity is less than the probability of having failures be- 
cause there are undetectable failures such as the breakdown of an output 
substitute switch. A reliable system should have a small probability of 
outage and a small probability of activity. 

Two additional criteria are used to measure the effectiveness of the 
overall protection plan. The improvement factor (IF) is defined as the 
ratio of the probability of outage without protection switching to that 
with protection switching. The activity factor (AF) is defined as the ratio 
of the probability of activity with protection switching to that without 
protection switching. These definitions agree with the common notion 
that an effective protection plan should provide more improvement and 
less activity. Thus, a better protection system has a bigger IF and a 
smaller AF. The activity factor is always greater than one. 

The probabilities discussed above are derived under the assumptions 
that the various failures are statistically independent and the failure 
rates are constant. These are very simple assumptions considering the 
complexity of the problem. The assumption of statistical independence 
is made to avoid estimating conditional failures, although there is 
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probably dependency between the through switch and the substitute 
switch. The constant failure rate implies exponentially distributed 
failures, i.e., any working item is as good as new. This is a reasonable 
assumption for solid-state devices after the initial “burn-in” period. 
Notice that no distributional assumption is made on the restoration time. 
Based only on the failure rates and the restoration times of the compo- 
nents of the system, the various probabilities are derived from the basic 
definitions of conditional probability. Not only does this approach re- 
quire little mathematical background, but the result is more general and 
more accurate than the usual method of Markoff chain or birth-and- 
death stochastic processes,*? which assume that both failure and res- 
toration times are exponentially distributed. 


IV. DETAILED RESULTS 


Table I introduces the notations and gives the estimated failure rates 
in FITS (number of failures per component per 109 hours), restoration 
times in hours, and the availabilities of the various components. The 
restoration time is the sum of the detection time and the equipment 
replacement time. The mean value of the replacement time ¢ is assumed 
to be 1 hour. Some failure rates are expressed in terms of other failure 
rates to show their relative dependence. This is necessary in later pa- 
rameter sensitivity studies. The failure of a substitute switch can only 
be detected when its use is called for. Thus, its detection time is the mean 
time between transmission failures of its corresponding channel, i.e., 
1/(A,; + Az + Ao). The same is true for the detection time of a regular 
detector, except that the assumed probability that a failed detector gives 
a no-alarm indication is 1/4. In both cases, the equipment replacement 
time is ignored since it is small compared with the detection time. 

The detection times of the hidden CPU (central processing unit) and 
EROM (erasable read-only memory) failures should also be similarly 
calculated. However, the failure of the regular channels to be exercised 
sequentially should provide local craftspeople with the indication that 
something is wrong. Therefore, the detection times are assumed to be 
24 hours. The availability? of an item is the probability that the item is 
working. It is a function of time with an initial value of one and with a 
steady-state value equal to the mean time to failure divided by the sum 
of the mean time to failure and the mean restoration time. If a compo- 
nent has a short failure detection time, the transient portion in its 
availability value vanishes quickly, and the steady-state theoretical 
availability approximates the actual availability very well. For example, 
the steady-state availability of the regular channel is p, = 1/1.000001. 
The reliability function of the regular channel is e~ 1°. It takes only 
1 hour for the reliability function to reach its steady-state availability 
value. 


MICROPROCESSOR PROTECTION SWITCHING SYSTEM 2637 


8161 YAGW3ALdSS “IWNYNOL IWOINHOSL WSLSAS T19d SHL = 8€92 


Regular channel 
Through switch 
Output switch 
Substitute switch 
Regular detector 
Protection detector 
Protection channel 
CPU 

EROM 


RAM 


Table | — Estimated failure rates 
Mean Restoration 
as Time (hr) 
Ar = 1000 Hp=t Dr 
At = 150 Beat Pt 
1 
do = 3 Nt Bo =t Po 
2 1 
\ =X >. = = 
eee i ae a CES Ps 
1 1 
= 9) a 
ia a A Ke ANG 
Ap = 300 Hp =t Pp 
Mp=Ar+4A5+100  pp=t Dy 
- = 500 we =24+t De 
de = 300 He = Uc Pe = 
Aq = 400 Ug =t Pa = 


aa ea ae 
L+Xgus (As Fas yr | 


Pa = 


Availability 


1 


1+ ApH 


1 


1+ Ate 


a 
1 + Aout 


1 us —Ostus})T] 
1 ue dd 

i ae 

1+ rqea (Ag tug )?T 

ee, See 

1+ ADHED 


1 
1 + ApUp 


ees 
1+ AcHe 


Cerernl 
1+ dete 


Gr) 
1+ ata 


[1 —e~ atu! Ty 


These arguments do not hold for failures requiring long detection 
times. For instance, the mean time to failure and the mean restoration 
time of a substitute switch are in the order of hundreds of years, while 
the life span of the equipment is expected to be only 40 years. To obtain 
an appropriate availability in such cases, one would observe that the 
restoration time of the substitute switch is exponentially distributed. 
This is due to the fact that the replacement time is ignored and the de- 
tection time depends on the transmission failures which are exponen- 
tially distributed. Thus the availability function can be derived explicitly 
as 


et es ar a estes )t. 

T+ Asus As + Ms , 

The availability p, given in Table I is the A,(t) averaged over the life 
span T' of the equipment. The availability of the detector pg is obtained 
similarly. The availability expressions of the EROM and the RAM reflect 
the use of 4 EROMs and 2 RAMs in the TPSS. 

To gain insight and to study the sensitivity of the derived probabilities 
to the estimated failure rates and restoration times, the various estimated 
parameters are varied one at a time to show the system reliability trends. 
The results are presented graphically in the figures. In each figure, the 
solid line corresponds to the ordinate at the left and the dotted line to 
that at the right. 

Figures 2 through 7 present the variations of the outage and the ac- 
tivity probabilities as functions of the regular channel, the detector, the 
switch, the CPU, the EROM, and the RAM failure rates, respectively. Most 
of the curves are almost linear because, for the small failure rates of in- 
terests, they are still in their linear regions. As far as the probability of 
outage is concerned, undetectable failures are the most damaging. The 
hidden detector and the substitute switch failures contribute to the 
bigger slopes in Figs. 3 and 4. Increasing the microprocessor system 
failures adds very little to the outage probability, as can be seen from 
Figs. 5 to 7. The probability that has the fastest increase is the switch 
failure rates because there are so many switches in the system. Figure 
8 indicates that service outage can increase substantially if the re- 
placement time for failed equipment is long. Figure 9 shows the effect 
of varying the detection time of the hidden microprocessor failure. 
Neither the outage nor the activity probability is sensitive to the de- 
tection time. Figure 10 shows the effect of varying the number of regular 
channels equipped. The discrete points in the figure are connected to 
show the almost linear trends. When the system is fully loaded, i.e., n 
= 20, there are about 2 minutes of service outage each year due to 
equipment failures and there is about half an hour of maintenance ac- 
tivities. It should be emphasized that the curves present the right trends 
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Fig. 2—Probabilities of outage and activity as functions of regular channel failure 
rate. 
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Fig. 3—Probabilities of outage and activity as functions of detector failure rate. 


rather than numerical accuracy. From Fig. 2, if the failure rate of the 
regular channel is increased by ten times, there will be 4 minutes of 
outage and 4 hours of activity each year. Figure 10 shows the two 
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Fig. 4—Probabilities of outage and activity as functions of through switch failure 
rate. 
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Fig. 5—Probabilities of outage and activity as functions of CPU failure rate. 


probabilities as functions of the number of regular channels. The discrete 
points are connected to indicate trends. For terminal circuits which 
usually have small failure rates, there is scarcely any need for a second 
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Fig. 6—Probabilities of outage and activity as functions of EROM failure rate. 
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Fig. 7—Probabilities of outage and activity as functions of RAM failure rate. 


protection channel even when the number of regular channels is 
large. 

A system without protection switching has only the regular channels 
and their corresponding detectors to indicate alarms. The switches and 
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Fig. 8—Probabilities of outage and activity as functions of equipment replacement 
time. 
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Fig. 9—Probabilities of outage and activity as functions of hidden microprocessor 
failure detection time. 


the microprocessor devices are not required. Thus there is definitely less 
activity in the maintenance offices. Figure 11 shows the trend that, for 
small regular channel failure rates, the IF can be less than unity, i.e., 
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Fig. 10—Probabilities of outage and activity as functions of number of regular 
channels. 


having protection switching actually causes more service outage. This 
is true when the failure rate of the regular channel is small compared with 
those of the protection switching circuits. Furthermore, protection 
switching generates many more activities at low regular channel failure 
rates. Figure 12 amplifies this fact by examining the 1 X 1 configuration. 
The IF is so small and the AF is so big that implementation of a1 X 1 
protection plan is questionable at low failure rates. Figure 13 gives the 
variations of the two factors with detector failure rates. Since detector 
failures have little effect on the outage probability of an unprotected 
system, the IF decreases with increasing detector failure. The interesting 
shape of the AF curve is due to the relatively rapid increase in the prob- 
ability of activity for an unprotected system when the detector failure 
rates are small. This behavior is unique to the variation of the detector 
failure rate because an unprotected system is equipped only with the 
transmission channels and the detectors. 

Figure 14 again indicates the important role played by the output 
switch. If its failure rate is high enough, the IF can reduce to less than 
unity. With a perfect switch, the outage of a protected system can be 
hundreds of times less than that of an unprotected system. The curves 
showing the two factors as functions of the CPU, the EROM, and the RAM 
failure rates are not given here. These curves can be simply deduced from 
Figs. 5 to 7 because the various probabilities of an unprotected system 
are independent of microprocessor failures. Similarly, the factors in- 
volving hidden microprocessor failure restoration time can be obtained 
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from Fig. 9. Figure 15 shows that both the IF and the AF are not very 
sensitive to how long it takes to replace failed equipment. Figure 16 varies 
the number of regular channels. It indicates that more than 10 regular 
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Fig. 11—Improvement and activity factors as functions of regular channel failure 
rates. 
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Fig. 12—Improvement and activity factors as functions of regular channel failure 
rates. 
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channels should be used to take advantage of the protection switching 
arrangement. 
Figure 17 exhibits an interesting behavior of general protection 
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Fig. 13—Improvement and activity factors as functions of detector failure rates. 
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Fig. 14—Improvement and activity factors as functions of through switch failure 
rates. 
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switching systems. As the failure rate of the regular channel increases, 
the IF increases from less than one to a maximum and then starts to 
decrease. When the failure rate becomes very large, the outage proba- 
bility is close to 1 with or without protection switching. Thus the IF ap- 
proaches 1 eventually. The maximum IF shown in the figure occurs at 
around 150,000 FiTs. Although it is unlikely for a terminal multiplexer 
to possess so high a failure rate, a line transmission system with many 
cascading repeaters may very well have a failure rate of this order. 
Therefore, whenever a line protection switching system is planned, the 
reliability should be studied to determine the length of the protection 
span so that the IF does not fall in its decreasing region. Of course, the 
outage probability should also be taken into account to meet any pre- 
scribed service objectives. 


V. CONCLUSIONS 


The reliability of the microprocessor-based TPSS has been studied 
in detail using conditional probability. Consideration of the four criteria; 
i.e., the probability of outage, the probability of activity, the improve- 
ment factor, and the activity factor, should provide an adequate de- 
scription of the effectiveness of the overall protection plan. Several 
conclusions can be drawn from the analysis. First, terminal circuits 
usually have low failure rates so that one protection channel is adequate 
for the protection of many regular channels without having excessive 
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Fig. 15—Improvement and activity factors as functions of equipment replacement 
time. 
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_Fig. 16—Improvement and activity factors as functions of number of regular 
channels. 


probability of service outage. Second, undetectable failures are usually 
the prime causes for increased outage probability and decreased im- 
provement factor. If preventive maintenance is ever to be carried out, 
the hidden failures should be the principal targets. Third, the micro- 
computer is reliable as a protection switching controller. Although mi- 
croprocessor system failures can cause false switching all by themselves, 
they contribute only a very small amount of the total outage if adequate 
self-checking is implemented. Reliability could be further improved by 
providing hardware interlock logic to guard against an insane micro- 
processor. For example, logic circuit can be provided in the TPSS to 
prevent the operation of an output switch whenever its input switch is 
inactive. Fourth, all the figures indicate that, around the various esti- 
mated failure rates of interest, the outage probabilities increase almost 
linearly with the failure rates. Thus there is no “preferred” range of 
failure rates that any equipment should be designed to. Only the sensi- 
tivities of the outage probabilities to the various estimates are different. 
Fifth, for any TPSS, the implementation of a 1 X 1 protection plan should 
be studied carefully. Even if there is improvement in the outage proba- 
bility due to equipment failure, the increased activity will generate more 
maintenance-induced outages, not to mention increased costs. 

The above comments do not apply in line protection switching sys- 
tems, which have much higher regular channel failure rates because of 
the cascaded repeaters. Finally, Fig. 17 suggests one more consideration 
in determining the length of a line protection switching span. The failure 
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Fig. 17—Regular channel failure rates as functions of improvement factor. 


rate of the line should preferably not fall into the decreasing region of 
its improvement factor. The last two points are obvious and interesting 
protection switching behavior patterns which seem not to have been 
explictly pointed out before. 


APPENDIX A 


This appendix discusses microprocessor self-test algorithms whose 
purpose is to generate alarms as early as possible to initiate maintenance 
actions. The test should be exhaustive but should not require too much 
additional program memory. An 8-bit microprocessor is used in the TPSS 
application. 

When the power is turned on, the microprocessor immediately per- 
forms a thorough RAM check. Static RAMs are used, so there is no pattern 
sensitivity problem. The checking algorithm is to write the least-sig- 
nificant 8-address bits of each RAM byte into that specific RAM location. 
After all RAM locations are loaded, the contents of each byte are com- 
pared with its least-significant 8-bit address. After a byte is checked, its 
contents are complemented and checked again. The complemented 
contents will remain in those bytes already checked. This algorithm is 


MICROPROCESSOR PROTECTION SWITCHING SYSTEM 2649 


able to detect any bit, any data pin, and any combination of address pins 
stuck to zero or one. It can also discover data and address lines shorted 
together. Thus most RAM failures can be detected. 

The ROMs are checked immediately following the RAM check. Two 
consecutive bytes in each ROM are reserved for self-test. One byte is used 
for parity check and the other for short-circuits in address and data lines. 
The microprocessor reads out every byte in the ROM and performs a 
cumulative odd parity check through an exclusive-OR operation on each 
bit. It will be seen first that, as far as independent ROM bit failures are 
concerned, it is adequate to use only one byte to check the parity of all 
ROMs no matter how many. ROMs are used in the system. Let @ be the 
number of ROM bytes (excluding the reserved checking byte) used in the 
system and ¢ be the probability of a ROM bit failure. The probability of 
having parity violations is 1 — (1 — p)8, where p is® 


[rat gg AOR YY 
2 2 


The probability of having bit errors is simply 1 — (1 — «)@+))*8, For 
< 1, both probabilities can be approximated by 8 X (€ + 1) X «. Thus 
the single byte parity check is adequate when @c « 1. It can be seen below 
that this condition is always valid in practice. Since the experimental 
failure rate of the 1K-byte EROM is 300 FITS, the failure rate of each bit 
cannot be more than 300/(8 X 1024) ~ 0.037 FIT. If a ROM failure can be 
discovered in 24 hours, then « < 10-9. The number @ is limited by the 
microprocessor addressing capability which is 64K. Therefore, Ze « 1. 
The reason that one parity byte is used in each ROM is to detect address 
and data lines stuck to one or zero. Since the ROM has a capacity equal 
to a power of 2, astuck output looks like an even number of ones or zeros 
and violates the odd parity. A stuck address will cause half the bytes to 
be read twice and again violate the odd parity. 

The contents of the bits of the other byte used for self-test are alter- 
nating ones and zeros. When this byte is read, short-circuits in data lines 
are detected. If this byte is located at an address whose 10 least-signifi- 
cant address bits are alternating ones and zeros, reading this byte will 
most likely detect short-circuits among these address lines. The prob- 
ability is very small that within the same ROM another byte which also 
contains alternating ones and zeros is read because of shorted address 
lines. To detect some of the short-circuits in the remaining six most 
significant address lines, complemented numbers are stored in these 
checking bytes according to their address parities. Each ROM can select 
one of two hexidecimal numbers, AA or 55, to store at one of two ad- 
dresses. For the first ROM with 0000 starting address, the two addresses 
are 0155 and 02AA. 

The two consecutive checking bytes must be preceded by a jump or 
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branch instruction to bypass them in normal program execution. It is 
obvious that, if a single parity checking byte is located at an address with 
alternating ones and zeros, it alone can detect all ROM failures mentioned. 
above except shorted data lines. It is sometimes possible to make use of 
the opcode and the operand of the jump or branch instruction to check 
the shorted data lines. If any failure occurs in the first ROM where the 
checking program is stored, the failure cannot always be detected. Du- 
plicating the first ROM may be a possible solution. 

After the two memory tests, a few instructions are exercised to test 
the CPU. Then the microprocessor starts executing the main program. 
Under normal circumstances, the program never comes back to the above 
RAM, ROM, and CPU tests. Different checks are performed in the main 
program. To avoid delaying the program execution, only distributed 
checks on the memory system are made. For example, in going through 
a program loop, only one RAM byte is tested and only one ROM exclusive 
OR is taken. However, the ROM check uses the same algorithm discussed 
above. The RAM check uses alternating ones and zeros which detect only 
shorted data lines and stuck bits because the exhaustive RAM check. 
discussed before will destroy the temporary data stored, in addition to 
requiring long execution time. After each cycle of the nonexhaustive RAM 
check, an additional test’ is made. Zeros are stored in the first RAM byte. 
Ones are stored only in RAM bytes with addresses 2', i = 1,2,---. Every 
time all ones are loaded into an address, the contents of the first all-zero 
byte are also checked. The check is also distributed so as not to delay 
normal program execution. Most remaining RAM failures’can be dis- 
covered by this additional test. 

The effectiveness of the two RAM checking algorithms discussed above 
is similar. The first one used when turning on the power requires fewer 
steps and is faster. The second one does not destroy any temporary data 
because every check involves at most two RAM bytes (the first byte and 
the 2'th byte) whose contents can be temporarily stored into CPU reg- 
isters. 

No CPU check is performed in the main program. A restarting sanity 
timer is employed to detect CPU failures. Under normal operation, the 
program retriggers the timer at durations shorter than the length of the 
timer. If the timer times out, an alarm is generated and the micropro- 
cessor system will go through its power on restart cycle again. The re- 
starting sanity timer detects complete CPU failures. It can sometimes 
catch other CPU failures (for example, program counter skipping). It also 
reduces the damages that are caused by power transients because it re- 
starts the system. RAM failures sometimes cause the timer to time out. 
ROM failures have similar effects but are more difficult to be self-de- 
tected. Output failures can only be detected by reading back the output 
bits immediately after each output operation. 
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APPENDIX B 


This appendix derives the probabilities of outage and activity with 
and without protection switching. Figure 1 shows the configuration for 
al Xn protection switching system in each direction of transmission. 
The microprocessor is responsible for the switching actions of 2n regular 
channels. The unprotected system has only the regular transmission 
channels plus pilot detectors for alarm. 

The events of interests in deriving the outage probabilities are 


S: service outage without protection switching. 
Sp: service outage with protection switching. 
Gy: all regular channels are good. 

G2: both protection channels are good. 
G3: all regular detectors are good. 

G,: all through switches are good. 

G;;: all substitute switches are good. 
Gg: the microprocessor system is good. 
G7: all output switches are good. 


The events G;’s are assumed to be statistically independent. Their 
probabilities are given by 


P{G,} = p;” 
P{G2} = p? 
P{G3} = pz" 
P{G4} = p?” 
P{G5} = p3” 


P{G6} = Pm = PcPePa 
P{G7} = pe", 


where the notations are defined in Table I. The symbol q with appro- 
priate subscripts is defined to be 1 — p with the same subscript. Let G; 
be the complement of G; and g be the joint events of the G;’s with sub- 
scripts denoting the complemented events. For instance, 


80 = G1GeG3G4G5G6G7 
and 
835 = G1G2G3G4G5Ge6G7. 


If these events represent all the possible failure modes of the system, 
then 


P{Sp} = P{Spgo} + P{Spgi} + --- + P{Spge} + P{Spg7} 
+ P{Spgio} +--+ + P{Spgosase7} + P{Spgiesase7}. (1) 
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There are a total of 27 terms in (1). Half the terms involve the event Gz, 
which generates service outage regardless of the other events. There- 
fore, 
P{Sp} = 1 — po" + P{Spgo} + --- + P{Spge} + P{Spgi9} 
+++++ PlSpgosas6} + P{Spgiosas6}. (2) 
The 26 unknown terms in (2) are to be evaluated. Since the derivations 
of each term are very similar, only the details in obtaining the more in- 
volved P{Spgi345} and P{Spgo.} will be given. From the definition of 
conditional probability, 
P{Sp/g1345} = P{Sp/g1345, three or more channel failures} 
_+ P{three or more channel failures/g 13,5} 
+ P{Sp/gi34s, two channel failures}P{two channel failures/g 1345} 
+ P{Sp/g1345, one channel failure}P{one channel failure/g 1345}. (8) 


It is obvious that two protection channels cannot protect three failures; 
hence 


P{S p/g1345, three or more channel failures} = 1. 


The joint event of three ‘or more regular channel failures and 
G1G2G3G4G5G6G7 has the conditional probability 


P{three or more channel ee 1345} 


pe 2np ge n(n =D pe a? 
= X — Da ae sa pi")(1 aa D2")PmPo" : (4) 
Pig1345} 


The second term in (3) will be evaluated next. The various events will 
be abbreviated by their initials after their full names are introduced; e.g., 
tcf represents two channel failures. 
P{Sp/g 1345, tcf} = P{Sp/g1345,tcf, both failures in the same 
direction of transmission} - P{both failures in the same 
direction of transmission/g1345,tcf} + P{Sp/g1345, tcf, one failure 
in each direction} - Fone failure in each direction/g1345,tcf} 
=1- {n(n — 1)p7""Yq?p3 (1 — p2")(1 — p2”)(1 — 1p”) pmpa"y/ 
Pg 1345, tcf} + P{Sp/gi345,tcf, one failure in each 
direction} - P{ofied/g1345,tcf}. (5) 
Equation (5) follows because one protection channel cannot protect two 
failures in the same direction of transmission. The second term of (5) 
gives 
P{Sp/g 1345, tef,ofied} = P{Sp/g1345,tcf,ofied, two 
associated detectors are not both good} - P{two associated 
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detectors are not both good/g1345,tcf,ofied} 
+ P{Sp/g1345,tcf,ofied, two associated detectors good} 
- P{two associated detectors good/g1345,tcf, ofied} 
= 1-[n2p??-?q?p3(1 — pa)(1 — p?")(1 — p3")pmpo")/ 
Pig 1345, tcf,ofied} + P{Sp/g1345,tcf,ofied,tadg} - P{tadg/g1345,tcf,ofied}. 
(6) 
P{Sp/g1345,tcf,ofied,tadg} = P{Sp/g1345,tcf,ofied,tadg, 
both associated substitute switches good} - P{both 
associated substitute switches good/g1345,tcf,ofied,tadg} 
+1-[n2p?"-°q?pppa(l — pa"~?)(1 — p?")(1 — p3)Ppmpo"|/ 
P{g1345,tcf,ofied,tadg}. (7) 
P{Sp/g1345, tcf,ofied,tadg,bassg} = P{Sp/g1345, tcf,ofied, 
tadg,bassg, both associated through switches good} 
- P{both associated through switches good/g1345,tcf,ofied, 
tadg,bassg} + P{Sp/g1345, tcf,ofied,tadg,bassg, not both 
through switches good} - P{not both through switches 
good/g1345,ttcf,ofied,tadg,bassg} 
= 1-[n?2p?"-*q?pspa(l — pa"~*)pe(1 — p?”-*)p3(1 — ps") pmpo"l/ 
P\g1345,tcf,oefied,tadg,bassg} + P{Sp/g1345, tcf,ofied,tadg, 
bassg, nbtsg} - Pinbtsg/g1345,tcf,ofied,tadg,bassg}. (8) 
For the first term in (8), it is known that not all through switches are good 
because of G4. The outage probability is one because if the two failed 
channels have good through switches, the rest of the through switches 
must have failure. Finally, 
P{Sp/g 1345, tef,ofied,tadg,bassg,nbtsg} = P{Sp/g 1345, tcf, 
ofied,tadg,bassg,nbtsg, no other through switch failure} 

- P{no other switch failure/g 1345,tcf,ofied,tadg, 
bassg,nbtsg} + P{Sp/g1i345, tcf,ofied,tadg,bassg,nbtsg, other 
through switch failure} - P{other through switch 

failure/g1345,tcf,ofied,tadg,bassg,nbtsg} 
= 0+ [n?pr"grpppa(l — pa*)(1 — pz)(1 — pi”~”) 
-ps(1 — p3"*)Pmpo"|/P\s 1345, tef,ofied,tadg,bassg,nbtsg}. (9) 
In (9), the first conditional outage probability is zero because all the 
failures are protected by the two protection channels. The above deri- 
vations illustrate one of the basic approaches. Each event and its com- 
plement are assumed until the conditional probability of outage is either 
one or zero. 
The third term in (8) is similarly derived. 


P{Sp/g1345,0cf} = P{Sp/g1345,0cf, associated detector bad} 
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- Pfassociated detector bad/gi345,0cf} + P{Sp/g1345,0cf, associated 
detector good} - P{associated detector good/g1345,0cf} 
= 1-[2np;""'q-ppqa(1 — p?")(1 — p3")Pmb6"\/Pi{g1345,0cf} 
+ P{Sp/g1345,0cf,adg}P{adg/g 1345,0cf}. (10) 
P{S p/g1345,0cf,adg} = P{Sp/g1345,0cf,adg, associated 
substitute switch good} - P{associated substitute switch 
good/g1345,0cf,adg} + 1 - [2np7?"~'q,ppPa(1 — pa"~*) 
X (1 — p?")qsPmP5"|/Pi{g1s45,0cf,adg}. (11) 
P{Sp/g1345,0cf,adg,assg} = P{Sp/g1345, ocf,adg,assg, 
one other through switch bad} 
- P{one other through switch bad/g1345,0cf,adg,assg} 
+1-{2np?"""q-pppa(1 — pa” ')[1 — p77? — (2n — 1)p??-7q4] 
-ps(1 — p3”~")pmpo"}/Plg1345,0cf,adg,assg}. (12) 
Equation (12) indicates that the status of the through switch associated 
with the failed regular channel has no effect on the outage proba- 
bility. 
P{Sp/g1345,0cf,adg,assg,ootsb} = P{Sp/g1345, ocf,adg, 
assg,ootsb, bad through switch in other direction of 
transmission} - P{bad through switch in other 
direction/g1345,0cf,adg,assg,ootsb} 
+1-[2np?""'g,ppPa(1 — pa"™")pi(n — 1)p? "ais 
X (1 — p3""!)pmpo"|/Plg1345,0cf,adg,assg,ootsb}. (13) 
P\Sp/g 1345, 0cf,adg,assg,ootsb,btsiod} = P{Sp/g 345, 
ocf,adg,assg,ootsb,btsiod, bad switch has good detector} 
- P{bad switch has good detector/g1345,0cf,adg,assg,ootsb, btsiod} 
+1-[2np;"""q,ppPaqdanpi” “Geps(1 — p3"~")pmpo"/ 
P{gi345,0cf,adg,assg,ootsb,btsiod}. (14) 
P{Sp/g1345,0cf,adg,assg,ootsb,btsiod, bshgd} 
= P{Sp/g1345,0cf,adg,assg,ootsb,btsiod,bshgd, corresponding 
substitute switch bad} 
- P{corresponding substitute 
switch bad/g1345,0cf,adg,assg,ootsb,btsiod,bshgd} 


+ 0- P{corresponding substitute switch good/g1345,0cf, 
adg,assg,ootsb,btsiod,bshgd} 
2n-1 


= 1-[2np?""'q,ppp4a(1 — pa”) np?" "q:PsdsPmPO'\/ 
P{g1345,0cf,adg,assg,ootsb,btsiod,bshgd}. (15) 


From (8) through (15), 
P{Spg 1345} = D2Dmps"'{(x + x3)(1 — pZ")(1 — p?")(1 — p2") 
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+ x1[qa(1 — p?")(1 — p2") + pal — p77"). — p?”)as 
+pal=pi" )*|Laph! = Gn = 1)pP alps —p?"’) 
+ pa(l — p31) -(n — 1)p7??- = (tp) 
+ paganp?” *qips(1 — p3""!) + p3(1 — p3"-?)np?”*arps4s] 
+ x4[(1 — p3)(1 — p?")(1 — p2") + pa(1 — p3"-?)(1 — p?")(1 — p?) 
+ pa(l — pa )pe(1 — pi?) ps(1 — pe “2 + rit — po 2 
a — p?)(1 — p?*) p21 — p2?- 2}, (16) 


where 


x1 = 2np2""}q, 


xo = 1 — p?" — 2np?"~1q, 
x3 = 1— p2” — 2np??-1q, — n(2n — 1)p2"-2q2 


=n 5 2q2 


essere 2 - 


To evaluate P{Spgo.}, the events 


Hy: CPU is good 
Hp»: ROMs are good 
Hs: RAMs are good 


will be considered separately. Let h represent joint events similar to those 
for g, for example, hy = H;H»2H3. As before, 
P{Sp/go¢} = P{Sp/gog, both protection channels bad}P{both 
protection channels bad/g 26} + P{Sp/gog, one protection 
channel bad}P{one protection channel bad/g2.6}P{S p/g26, bpcb} 
= P{Sp/g26,bpeb,hy}P{h1/g26, bpcb} + P{Sp/g26, bpcb,h} 
X Ptho/go¢,bpcb} + P{Sp/go¢,bpcb,h3}P{h3/g 26, bpcb} 
+ P{Sp/g 6, bpcb,h12}P{h12/g26, bpcb} + P{Sp/g26,bpcb,h13} 


X Pthi3/g26, bpcb} + P{Sp/go6, bpcb,ho3}P{ho3/g 26, bpcb} 
+ P{Sp/gog,bpcb,h193}P{h123/g26,bpcb}. (17) 


The microprocessor operation is so complicated that simplifying as- 
sumptions have to be made before (17) can be further evaluated. There 
are two kinds of CPU failures. The first kind is a partial failure which may 
not be detected by the self-checking method discussed in Appendix A. 
For instances, program counter skipping and one CPU transistor failure 
within the CPU may not always be detectable. This partial failure may 
generate false switching and result in service outage. The second kind 
is a complete failure, and the CPU operation stops altogether. No false 
switching will be made in this case, and the sanity timer will detect the 
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failure immediately. It is assumed that partial failures accounts for 20 
percent of the total CPU failures. 

When the CPU is partially failed, it executes the contents of the ROMs 
insanely. Every “instruction” has a finite probability of generating a false 
switching. The TPSS software contains approximately 4000 bytes of 
which 100 can be I/O instructions. Out of the 2n + 5 hardware addresses, 
2n have outputs controlling the switches. If a correct parity bit and an 
appropriate output switch control bit are stored in the accumulator, an 
1/0 instruction will operate the output switch. If the protection channels 
are bad, the operation of the output switch will generate service outage 
regardless of the status of the input switch. Thus the probability p; that 
any instruction will cause an outage is approximately 


100 1 2n 


P1™ 4000 4 In +5" 


When the protection channels are working, the same probability is 
now 


100 1 2n 


P2 4000 8 In+5 


because the input switch should be inactive for the false output switching 
to generate service outage. It is to be noted that false switching can also 
occur randomly if the 8-bit “instruction,” the 16-bit “address,” the parity 
bit, and the switch control bit happen to match the real instruction and 
address. This probability is of the order 2n/276 and is negligible compared 
with p; and pg. On the average, each instruction takes about 4 micro- 
seconds. Thus before restoration, about 


HM, X 60 X 60 X 106 
yy SS SS 
4 
“instructions” are executed. The probability p; that an outage will occur 
is 








P3=Pitqiwpit::-+qp'pi 
iat 
= Pi 
Li 
=1-q?. 


When the protection channels are good, the corresponding probability 
is 


pa=1— af" 
After a false switching, it is possible that insane CPU may deactivate 


the switch and restore service. It may also operate other output switches 
to generate additional service outages. These two conditional proba- 
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bilities are small. If they are ignored, the outage probability assuming 
partial CPU failure and bad protection channels is then p3t/p,. If only 
one of the two protection channels is bad, let 

100 1 rn, 100 100 1 on 


P5™ 7000 8 2n+5 4000 4 Int5” 


The outage probability is pgt/u, where 
Pe = 1-45". 

When a memory failure occurs, the program counter jumps to an ar- 
bitrary location. The initial effect is somewhat like that of a partially 
failed CPU. Experiments indicate that outage is unlikely to occur if it has 
not occurred during the initial period. Since 25 out of the 4000 bytes are 
used to activate the output switches in normal program operation, a jump 
to these bytes will cause a false switching. Therefore, the false switching 
probability is 


Pi= Blas + py 
4000 
or 
Ps= eas + Po 
4000 : 


depending on whether the protection channels are bad or good. If only 
one of the two protection channels is bad, the probability is 


25 25 

4000? 

It will be assumed that all RAM failures can be detected. Most of the RAM 
bytes are used for stack. The effects of the ROM and the RAM failures 
are assumed to be identical, but their restoration times are different 
because not all ROM failures are self-detectable. When the CPU fails, 
memory failures are assumed to have no effect on the system. This makes 
the evaluation of the fourth, the fifth, and the last terms in (17) unnec- 
essary once the first term is evaluated. It is further assumed that when 
there are both ROM and RAM failures, the trouble can be detected im- 
mediately. Given the previous assumption, then 


P{Sp/g9¢,bpcb,hi} = P{Sp/go6,bpcb,h1, complete 
failure}P{complete failure/gog,bpcb,h1} 
+ P{Sp/gog,bpcb,h1, partial failure}P{partial failure/g2g,bpcb,h1} 


P9= 


Pat P109p0-2GcPePa 
Uc P{go¢,bpcb,h4} ‘ 
where 


P10 = (PrPaPtPsPo)2”. (18) 
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Dit P1092PcVeDa 
P{Sp/f26, bpcb,h2} = = ee 
(Sp/g26, bpcb,ho} be P{gog, bpcb,ho} 


2 
PIS sb bh = _P109pPcPeGa_ 
{Sp/go6,bpcb,h3} = p7 Pleos,bpcb hel 


2 
P{Sp/g0¢,bpcb,htos} = py eset ePededa 
(Sp/g26,bpcb,h23} = p7 P{go¢, bpcb,h 23} 


Hence 








t t 
P{Sp,Z26, bpcb} = pio ie = 0.2 qe + Pr PcGePa + PrBeQa|- (19) 


c e 


The expression P{Sp,g9g,opcb} can be similarly evaluated. Finally, 


Pet Pot 
P{S pg26} = P10 {20% & 0.2g¢ + DedeDa + Poveda | 
Cc 


He 


Psat Dit 
Te q2 | ‘ O2ge7k PcGePa t prveda | |. (20) 
c 


e 

After deriving (16) and (20), the remaining terms in (2) are easy to 
obtain. They will not be given here. Thus the outage probability with 
protection switching P{S,} is obtained from (2). It should be emphasized 
that, because there are hidden failures, multiple equipment failures 
cannot be neglected in evaluating the various terms in (2). In fact, the 
term that contributes the most to the outage probability is P{Spg135}, 
which involves both of the undetectable failures (detector and substitute 
switch). 

Since the detectors used to generate alarms do not affect signal 
transmission, the outage probability without protection switching is 
simply 


P{S} = 1 — p2". (21) 
The improvement factor is 
P{S} 
= (22) 
PtSp} 


Next, the probabilities of activity with and without protection 
switching will be considered. The additional events of interest are 


A: activity without protection switching 
Ap: activity with protection switching 
Gs: protection detectors are good. 


Gs is redefined because protection detector failures generates mainte- 
nance activities, but the hidden substitute switch failures are assumed 
to cause no activity. To calculate the probability of activity with pro- 
tection switching, notice that whenever Gj, G4, and G7 occur, there will 
definitely be maintenance activity. Furthermore, the events G2 and G; 
are detectable when Gg is true. Therefore 
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P{Ap} = 1 — (p,p:po)?” + (PrptPo)?"Pm(1 — pppb) + P{Apgo} 
+ P{Apg3} + PiApge} + PiApgo6} + PiApgac} + PiApgse} 
+ P{Apgo36} + P{Apgos6} + PlApgase} + PlApgosse}. (28) 
In (23), P{Apgo} is always zero. The last seven terms are negligible com- 
pared with P{Apg3} and P{Apgg}. It is assumed that 10 percent of the CPU 
and the ROM failures will not generate alarm. The derivation of P{Apgg} 
is similar to that of (17). For example, 
P{Ap/gg¢h1} = P{Ap/g¢,h1, undetectable 
failure}P{undetectable failure/ggh1} 
+ P{Ap/g¢,h,, detectable failure}P{detectable failure/g¢,h4} 


t 
= 0+-—+ (prpaPePo)*"(PpPp)? : 0.9 + dePePalPigehi}. 


Uc 


Thus, 


. 
PlApeel = (PsPaPePo)*(Dppp)? | 09 a 


Cc 


t 
+ 0.9 — pcGePa + peda | (24) 
He 
If it is assumed that, when a detector fails, the probability that it is 
stuck to an ON state is 0.25, then 


P{Ap/g3} = P{Ap/g3, one detector bad}P{one 
detector bad/g3} + ---+ P{Ap/g3, 2n detectors 
bad}P{2n detectors bad}. (25) 


The ith term in (25) is 


P{Ap/g3,idb} = P{Ap/g3,idb, all bad detectors 
| on}P{all bad detectors on/g3,idb} 
+ P{Ap/gs3,idb, some bad detectors off} 
- P{fsome bad detectors off/g3,idb} 


2n — 

; (P-P:Po)2"(PpPD)*Pm ( 2 pz'qi(1 — 0.25%) 

_ 0 Sa SS ee 
Hd P{gz,idb} 


Therefore, 
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t 2n 3 . 
P{Apg3} = ge” (PpPp)*(P;PtPo)?” Xu pa" 'gi;(1 — 0.25). (26) 
Equations (23) through (26) yield the probability of activity with 


protection switching P{Ap}. The probability of activity without pro- 
tection switching P{A} is simply 


t 2n ‘3 
P{A} = 1- p*+—p?" ¥° ee p#*-igi(1 — 0.25%), 
U 





Hb i=1 
where 
eee! ee 
ne 1+ AgHo 
and 
_ 1 
Lp ad, 


is the detector restoration time without protection switching. The ac- 
tivity factor is given by 


PIA 
AF = Pidp| : (27) 
P{A} 
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Offset Multireflector Antennas with 
Perfect Pattern Symmetry and 
Polarization Discrimination 
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Conditions are derived that are useful for designing reflector an- 
tennas with excellent cross-polarization discrimination. These con- 
ditions ensure circular symmetry and absence of cross-polarization 
everywhere in the far field of an antenna, provided a suitable feed such 
as a corrugated horn is employed. The spherical wave radiated by the 
fundamental mode of such a feed has circular symmetry around the 
axis, and it is everywhere free of cross-polarization. An arbitrary se- 
quence of N confocal reflectors (hyperboloids, ellipsoids, paraboloids) 
is combined with such a feed. It is shown that it is always possible to 
ensure circular symmetry (and absence of cross-polarization) in the 
antenna far field by properly choosing the feed axis orientation. If the 
final reflector is a paraboloid, a simple geometrical procedure can be 
used. It is also shown that the asymmetry caused by an arbitrary 
number of reflections can always be eliminated by properly introducing 
an additional reflection. An application to the problem of producing 
a horizontal beam using a vertical feed is discussed. Two arrangements 
are described that may be useful for radio relay systems. 


Use of orthogonal polarizations is often required in radio systems to 
double transmission capacity. Antennas providing good discrimination 
between the two polarizations are then needed. The main purpose of this 
paper is to derive and discuss certain conditions that ensure excellent 
discrimination. When two or more reflectors and a suitable feed are ar- 
ranged in accordance with these conditions, the antenna far field has, 
in all directions, the same polarization of the feed excitation. Further- 
more, its pattern has circular symmetry. The above conditions also 
minimize astigmatism, and for this reason they are also useful* in the 
design of multibeam antennas (with several feeds). 


* This is the subject of an article being prepared. 
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1. INTRODUCTION 


A suitable feed for the antennas considered here is realized by properly 
corrugating the walls of a circular horn.!* The spherical wave radiated 
by the horn then has circular symmetry and, by placing the feed at the 
focus of a paraboloid, an antenna with circular symmetry in the far field 
is obtained, provided the paraboloid is centered around the feed axis. 
Furthermore, the polarization of the plane wave reflected by the para- 
boloid then coincides with that of the feed excitation. 

However, in the centered configuration the reflected wave is in part 
blocked by the horn.* To avoid this, the horn axis can be offset as in Fig. 
1, but unfortunately this causes asymmetry in the pattern after reflec- 
tion, resulting in an undesired cross-polarized component.» The same 
behavior occurs if, instead of a paraboloid, an arbitrary reflector system 
with a single axis of revolution is used. In Fig. 1, the asymmetry of the 
reflected wave increases with the angle of incidence a of the ray corre- 
sponding to the horn axis. This particular ray will be called principal 

ray. 

Although a single offset reflection always causes some asymmetry, it 





PRINCIPAL RAY 


__ PHASE CENTER F,, 
~~ OF HORN 








{ 
PARABOLOID AXIS 


HORN AXIS — ~ 


Fig. 1—The spherical wave radiated from Fo by a corrugated feed is transformed by 
an offset paraboloid into a plane wave. 


* This blockage impairs gain, side-lobes level, return loss, and cross-polarization dis- 
crimination. 
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is possible to combine two reflections with nonzero angles of incidence 
so as to ensure perfect symmetry after the two reflections.’-!° In this 
paper we generalize and extend the results of Refs. 7 to 9 in several re- 
spects. First, the analysis here is not restricted to only two reflections, 
nor does it assume the final reflector is necessarily a paraboloid. Second, 
very simple conditions that guarantee symmetry after the final reflection 
are obtained. These conditions are shown to be direct consequences of 
a general principle of equivalence (see the appendix). Third, a general 
solution is given to the problem* of restoring the symmetry of a wave 
whose initial symmetry has been distorted by an arbitrary number of 
reflectors. 

In Section III, two arrangements with excellent performance in 
cross-polarization are described. Both arrangements produce a horizontal 
beam using a vertical feed and may therefore be useful for microwave 
radio systems. 

The following analysis is based on geometrical optics. Furthermore, 
the far field for the antennas of Figs. 12 and 18 is not derived in Section 
III, but it is important to note that the principle of equivalence of the 
following section allows the aperture field distribution for both antennas 
to be derived replacing the reflectors with a single paraboloid, centered 
around the feed axis. The aperture field distribution and far field of such 
a paraboloid are well known.!> As pointed out at the beginning of this 
introduction, the entire aperture will be polarized in one direction if the 
feed is linearly polarized. The far field is thus free of cross-polarization, 
neglecting secondary effects such as edge diffraction. 


Ill. THE EQUIVALENT REFLECTOR AND THE ORIENTATION OF ITS AXIS 


Suppose a spherical wave from F, initially with symmetrical pattern, 
is successively reflected N times, using paraboloids, hyperboloids, and 
ellipsoids as shown in Fig. 2 for N = 3. The reflectors are properly ar- 
ranged so that a spherical wave is produced after each reflection. Thus, 
if F,, is the focal point after the nth reflection, the nth reflector 2, 
transforms a spherical wave centered at F,—; into a spherical wave 
centered at F,,. Note that some of the points Fo, Fj, ---, Fy may be at 
©, in which case the corresponding spherical waves become plane waves. 
In Fig. 2, F3 is at ©, and therefore the last reflector is a paraboloid. 

It is shown in the appendix that such a sequence of confocal reflectors 
is always equivalent to a single reflector which will be either an ellipsoid, 
a hyperboloid, or a paraboloid. This equivalent reflector produces, after 
a single reflection, the same reflected wave’ as the given sequence of 


* An interesting formulation of this problem is given in Ref. 10. 

t Thus, if one considers the field distribution over a wavefront reflected by the equivalent 
reflector, it will coincide with the field distribution over the corresponding wavefront 
produced by the given sequence of reflectors. 
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PRINCIPAL RAY ALL RAYS REFLECTED 
BY D3 MEET AT F3, 
WHICH IS AT © 





Fig. 2—The spherical wave from Fo is transformed into a plane wave by three confocal 
reflectors. The nth reflector transforms the spherical wave from F,,-, into a spherical wave 
converging towards F,,. 


reflectors. Thus, for the purpose of determining the properties of the 
reflected wave, one may replace the WN reflectors with the equivalent 
reflector. This reflector has an axis of symmetry, which passes through 
Fo, and will be called the equivalent axis. It is clear that in order that 
the symmetry of the incident beam be preserved, the principal ray must 
coincide with the equivalent axis.* 


2.1 The central rays, their closed path, and the equivalent axis 


Consider first N = 1. Suppose the reflector 2; and one of its foci, Fo, 
are given, but the exact location of the axis of 2; is not known and must 
be found. Then one may proceed as follows. Let a ray from Fy be reflected 
twice by 21, as shown in Fig. 3, and let § and §” be the initial and final’ 
directions of the ray. Then, from Fig. 3, 


§ = §” (1) 


only when the ray coincides with the axis. Thus, the axis can be found 
by searching for a ray that satisfies this condition. Note from Fig. 3 there 
are two such rays, with opposite directions. 

Next consider N > 1. Since a confocal sequence of reflectors 2j,---, 
yn is equivalent to a single reflector 2,, the above procedure is appli- 


* Since one can travel along the equivalent axis in two opposite directions, two opposite 
orientations can be chosen for the principal ray. 
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(a) (b) (Cc) 


Fig. 3—The axis of 2; is determined by varying § until § = §”. 


cable also to this case. Thus, to determine the axis of 2, (equivalent axis), 
one must consider a ray from F, with initial directions §. This ray must 
be reflected twice by Z,, and § must then be chosen so that §” = §. Notice 
that the two reflections by 2, imply a total of 2N reflections in the 
original configuration. The first. N reflections take place in the order 2}, 
+++, Dn, while the last N have the reverse order Dy, +--, 21. The final 
ray passes again through Fo, with the same direction as the original ray. 
In Fig. 4a, § + §”. In Fig. 4b, on the other hand, condition (1) is satisfied, 
and therefore the ray through Fp gives the correct orientation of the 
equivalent axis (and the principal ray for which symmetry is pre- 
served). 

Notice that if, after the above 2N reflections, the ray in Fig. 4a is re- 
flected 2N more times it will not follow the same path of the first 2N 
reflections. In Fig. 4b, on the other hand, the path of the first 2N re- 
flections is closed. This closed path, which determines the equivalent 
axis, will be called the central path. The two rays that proceed along the 
central path in opposite senses will be called the central rays. 

We show next that condition (1) leads to a straightforward geometrical 
procedure for determining the equivalent axis when Dy is a parabo- 
loid. 


2.2 The equivalent axis when the last reflector Xj is a concave paraboloid” 

It is now shown that, when the last ellipsoid in Fig. 4a is replaced by 
a concave paraboloid, the final ray direction §” becomes independent 
of the initial direction s’. This constant value of §” then gives the di- 
rection of the equivalent axis, which can thus be found straightfor- 
wardly. . 

Notice the path of Fig. 4a involves two successive reflections by the 
last ellipsoid Dy. Let y be the angle between the axis of Dy and the ray 
produced after the second reflection (see Fig. 5). The parameters of the 
ellipsoid Zy are now gradually modified, keeping the vertex V and the 
focus Fy, fixed, increasing the distance between Fy and Fy-, until 


* The following considerations apply also when Dy is a convex paraboloid, but this case 
is of little practical interest and will therefore be ignored. 
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(a) 


1 _- EQUIVALENT AXIS 


ZN 
| z 


(N = 3) 


(b) 


Fig. 4—(a) 2N successive reflections. (b) The central path. The equivalent axis through 
Fo is obtained by varying in (a) the initial direction § until § = §” as shown in (b). 


Fn — ©. The ellipsoid then becomes a paraboloid with focus Fy—, and 
from the figure y = 0, which shows that 


If aray from the focus F'y_; of a paraboloid is reflected twice 
by the paraboloid, so that the second reflection occurs at 
©, the final ray coincides with the paraboloid axis and it has 
the direction going from F'y_,; towards the vertex V of the 
paraboloid. (2) 


This implies that, when in Fig. 4 the last ellipsoid Dy is replaced by 
a paraboloid, the direction of §” becomes independent of §, and it can 
be determined by tracing the ray Fy—;V as shown in Fig. 6. The direction 
§” so obtained gives the equivalent axis, as one may verify considering 
a ray with initial direction given by the above value of §”. One can see 


from Fig. 6 the path of this ray closes, after 2N reflections. Thus, 


To obtain the equivalent axis of a sequence of N — 1 re- 
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Ni 


Fig. 5—As the distance of Fy from the other focus F'y- is increased, keeping V and 
Fn-, fixed, the ellipsoid approaches a paraboloid with vertex V and focus Fy-1; for the 
ray reflected at I’ one has y — 0. 





Zn 
Sa 
Vv = 
Q as 
Sk 7 RAY REFLECTED BY 
/ THE PARABOLOID 
{ AT ~ 
PARABOLOID AXIS 
\ 
N=3 Age AXIS 

eee 


‘ 


Fig. 6—By tracing from ~ the path of the ray defined by the paraboloid axis one obtains 
after N — 1 reflections the equivalent axis through Fo. If a symmetrical feed is placed at 
Fo, centered around the equivalent axis, a symmetrical pattern will be reflected by the 
paraboloid. 


flectors 21, 22,+++, Lv—1 followed by a paraboloid Zy with 
focus Fy; and vertex V, simply reflect N — 1 times the ray 
Fy-1V by 2yn-1, 2N-9,°*+, D1. The final ray through Fis 
the equivalent axis and, therefore, the principal ray along 
which symmetry is preserved. (3) 


As an example, consider N = 2, and assume the first reflector is not 
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a paraboloid.* Then four different arrangements are obtained depending 
on whether the first reflector is an ellipsoid or an hyperboloid, and is 
convex or concave. In each case (see Figs. 7 and 8), the equivalent axist 
is determined by the intersection I’ of the paraboloid axis with the first 
reflector. The equivalent axis is the line Fol’. Note the axis of the para- 
boloid intercepts the first reflector 2, in two points, but only one, J’, is 
acceptable.! The acceptable point is the point of reflection of the ray FV. 
Since only one side of the surface 2D, is reflecting, only one of the above 
two points can be considered a point of reflection for the above ray. 

From Figs. 7 and 8, since in all cases the equivalent axis and the par- 
aboloid axis meet on 21, the angles 2a and 28 giving their inclinations 
from the axis of 2, are related, 


tan a = m tan B, (4) 


where m is the axial magnification of 2; given by the distances of the 
reflector vertex Vo from the two focal points Fp and Fj, 


mn = LFoVol (5) 


[FiVol 
Note that if e is the eccentricity of the reflector, in Figs. 7 and 8, 


pac e-l e+11l1-e 
e—-l’et+1’1—-e’1lte’ 





(6) 


respectively. In Fig. 7 one has e > 1, whereas in Fig. 8,0 <e <1. 

In the two cases of Figs. 7a and 8a, eq. (4) is equivalent to eq. (1) of Ref. 
9. In the other two cases, on the other hand, eq. (1) of Ref. 9 is not ap- 
plicable [to obtain a correct relation, one has to replace a with 8 in eq. 
(1)]. 


Another useful relation, derived in the following section, is 





tani = tan p. (7) 


1-M 


It relates the angles of incidence i and p of the central ray on the two 


* The case where 2 is a paraboloid is treated in Section 2.6. 

t That is, the beam orientation for which symmetry is preserved. 

t Notice for the purpose of deriving the equivalent axis that the entire surfaces of the 
various ellipsoids, hyperboloids, and paraboloids must be considered to be reflecting. Thus, 
both branches of an hyperboloid must be considered. Of course, an actual antenna will 
use only certain sections of the various surfaces. 
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(a) 
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(b) 


Fig. 7—How to determine the central path and the equivalent axis of a paraboloid 
combined in (a) with a convex hyperboloid and in (b) with a concave hyperboloid. 


reflectors (see Figs. 7a and 8) to the magnification M, defined as 


Ma aol eo 
[IF 1| 
I being the point of incidence of the central ray on the first reflector. In 
eq. (8) one has to take the positive sign when Fp and F are on opposite 
sides of the tangent plane at/, as in Fig. 8; otherwise, as in Fig. 8, M < 
0. The angles of incidence must be taken with opposite sign in Figs. 7a 
and 8, where the two reflections have opposite senses; in Fig. 7b, on the 
other hand, i and p have the same sign. 
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Fig. 8—How to determine the central path and the equivalent axis of a paraboloid 
combined in (a) with a concave ellipsoid and in (b), with a convex ellipsoid. 


The magnification* M determines the ratio between the angular width 
Qo of the beam incident as I and the width Q, of the reflected beam. More 
precisely, t for small Qo, 


=—, (9) 


If M is specified, eq. (7) gives the angles of incidence i and p that result 
in asymmetrical beam after two reflections. 

A very general relation, which reduces to eq. (7) in the particular case 
where 2vy is a paraboloid, is derived in Section 2.4. 


* Another important significance of M is that the paraxial focal length f., for any of the 
arrangements of Figs. 7 and 8, in the vicinity of the central ray, is fe = Mfp, where fp is the 
paraboloid focal length f, = CF; f, has the significance that a small lateral displacement 
és of a feed initially placed at Fo will cause an angular displacement 56 = 6s/f. of the beam 
reflected by the paraboloid. . 

t Thus, if a beam of small angular width po is transformed by a sequence of N reflectors 
with magnifications M,, ++-,My, the final beam has angular width 


Q) = MQ, 
where M; = MiMo2--- Mn. 
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2.3 Relations governing the reflections of a central ray by the first or the last 
reflector 


The restriction that 2, must be a paraboloid is now removed. The 
closed path of the central ray in Fig. 4 involves two successive reflections 
by 21. Consider these two reflections and assume for the moment 2; is 
a concave ellipsoid as shown in Fig. 9a. The central ray in Fig. 9a first 
passes through F’ with direction 4, it is successively reflected at I’ and 
I, and it then passes again through F with direction ¢. 

Let 2i and 2i’ be the angles of the two reflections and M and M’ the 
corresponding magnifications, 

M=- fi M=- al ; 
Lo Ly 
£1, 9, etc. being defined in Fig. 9a. Then, if 27 = 27 + 27’ is the total angle 
of reflection (given by the angle between the final and initial rays ¢ and 
G) it is shown in Section A.3 of the appendix that 


(10) 





tan i = tan (11) 


M-1 
and 





tan i’ = tan y. (12) 


| 
1—™M’ 

Thus, if the parameters (M,i, or M’,1’) of either reflection are given, the 
total angle of reflection for a central ray can be calculated. Note that eqs. 
(11) and (12) apply also to the two consecutive reflections of the central 
ray by the last reflector Dy. 

In Fig. 9a, the reflector 2 is a concave ellipsoid, but eqs. (11) and (12) 
are valid also if 2 is an hyperboloid or is concave, as shown in Figs. 9b, 
c, and d. Note in cases 9c and 9d the central ray is first reflected at I’, 

,then passes through the point at ~ and is then reflected again at J. Figs. 
7a,b and 8a,b correspond to Figs. 9b, 9c, 9a, and 9d, respectively. 


2.4 How to arrange two reflectors 


Consider Fig. 10a showing a principal ray from Fo reflected by two 
reflectors 2; and LY». We wish to show that, in order that this ray be a 
central ray, i.e., that symmetry be preserved after these two reflections, 
their parameters M, M’,i, and i’ must satisfy the condition 


— 7 


ay; tan 1’. (13) 





. J 

tani = M 
1 
Consider the ray reflected by 2. Let this ray be reflected twice by Do, 
and then again twice by 2j, as in Fig. 10b. If 27 denotes the total angle 
of the first two reflections by 22 and 27’ the angle of the other two re- 
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Fig. 9—Two successive reflections. (a) By concave ellipsoid. (b) By convex hyperboloid. 
(c) By concave hyperboloid. (d) By convex ellipsoid. 


flections, one must have 

2y + 2y’ = 2x, (14) 
if the path of the ray is to close (which is necessary for it to be a central 
ray) after the four consecutive reflections. Now tan y is given by eq. (11), 
and tan 7’ by eq. (12) with y replaced by 7’. Thus, by requiring condition 
(14), one obtains condition (13). In the particular case where the second 
reflector is a paraboloid, 


M’=0 
and eq. (13) give Eq. 7 (with i’ = p). 


2.5 Restoration of beam symmetry after an arbitrary number of reflections 


Suppose an arbitrary sequence of N — 1 reflections 2, +--+ 2j— have 
distorted the initial symmetry of a spherical wave originating from F'p. 
We wish to restore symmetry by introducing an additional reflector 2y. 
Let the principal ray through Fy be reflected N — 1 times by the given 
reflectors as shown in Fig. 1la for N = 3. The reflector 2) must be chosen 
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(b) 


Fig. 10—Central path and equivalent axis of a combination of two ellipsoids. 


so that this ray is one of the two central rays of the sequence 21, +++, Uy. 
This means the path of the ray must close after 2N successive reflections. 
Now a part of this path, the section determined by the reflections of 21, 
Do,°**, N-1, is fixed in advance. Therefore let this part of the central 
ray be determined first. It starts at Fxy—, and, after 2(N — 1) reflections, 
it ends again at Fy—, with direction d as shown in Fig. 11a. Since its final 
direction 4 is given, its initial direction ¢ can be found by tracing the ray 
backwards. Once ¢ is known, the condition that 2j must satisfy is simply 
eq. (12), with y given by the angle between ¢ and Gd, shown in Fig. 11. 


2.6 How to determine the first reflector if the remaining ones are given 


The above argument applies also to the problem where the first re- 
flector, rather than the last, is to be found and the remaining reflectors 
are given. The only difference in this case is that one must use eq. (11), 
instead of eq. (12), as shown by the following example. To consider a 
situation of practical interest, suppose the last reflector 2y is a para- 
boloid as shown in Fig. 11b. Assume that all the reflectors except the first 
one are given and that >; must be chosen so that the central ray passes 
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Fig. 11—(a) How to determine the last reflection if the first N — 1 are given. (b) How 
to determine the first reflection if the last N — 1 are given. 


through the center C of the paraboloid aperture. Then, as in the previous 
problem, one notices that a part of the desired central path is fixed in 
advance. This part starts as F' with direction ¢ and, after 2(N — 1) re- 
flections, it ends at F, with direction a as shown in Fig. 11b. Once a is 
found (by ray tracing), the condition that 2; must satisfy is given by eq. 
(11), with y given by the angle shown in the figure between ¢ and a. 


‘2.7 The first and the last reflector are paraboloids 


Consider first N = 2, in which case eq. (13) with M = M’ = ~ demands 
that the angles of incidence on the two paraboloids be identical, except 
for a difference in sign. For this to happen, the axes of the two parabo- 
loids must coincide, in which case one can show that the two angles of 
incidence coincide for any choice of the principal ray. These remarks 
apply also to N > 2, since the last N — 1 reflectors can always be replaced 
by an equivalent paraboloid. Thus, 


In order that symmetry be preserved, when both 2; and Zy 
are paraboloids, the axis of 2, must coincide with the 
equivalent axis of D9, +--+, Zy, in which case symmetry is 
preserved by any choice* of the principal ray. (15) 


* A little thought shows that there is another case where the central ray is undetermined: 
namely, when the equivalent reflector is a flat plate. 
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Ill. AN APPLICATION 


The most important example of an offset arrangement is perhaps the 
horn reflector,!! an antenna consisting of a horn combined with a par- 
aboloid. The excellent properties of this antenna (negligible return loss, 
very low level of the far sidelobes, etc.) are well known. However, the 
angle of incidence on the paraboloid.is 45 degrees, and this causes in the 
far field a cross-polarized component of about —20 dB in certain direc- 
tions.!! The 45-degree angle of incidence is required to produce a beam 
orthogonal to the feed axis, which is an important requirement* for radio 
relay systems. In this section it is shown, with two examples given in Figs. 
12 and 18, how this requirement can be fulfilled using two or more re- 
flectors satisfying condition (7). In both Figs. 12 and 13, the feed is of 
the type described in Refs. 1 to 4, and therefore the antenna beam is 
essentially free of cross-polarization everywhere (see the last remark in 
the introduction). 

Figure 12 shows two large reflectors, a paraboloid and an hyperboloid, 
arranged to satisfy simultaneously condition (7) and the requirement 
i + p = 90°, without aperture blockage. This arrangement is of the type 
shown in Fig. 8b of Ref. 7. In Fig. 13, three reflectors, a large paraboloid 
3, and two small hyperboloids 2» and 2, are used. This arrangement 
is more compact, and it requires less total reflecting area, than the one 
of Fig. 12. It is thus particularly attractive when the antenna aperture 
is large, i.e., the far-field beamwidth is small. The angle of incidence i 
and the magnification M of the first reflector 2; satisfy condition (7), 
with p given by the angle shown in Fig. 12. To understand the signifi- 
cance of p, replace the last two reflectors 2» and 3 by their equivalent 
paraboloid. According to (3), the axis of this paraboloid is obtained from 
the axis of 23 by reflecting it once, onto Yo, as shown in Fig. 13. Then 2p 
is the angle the central ray makes with this equivalent axis. Note that 
Pp is equal to the angle of incidence on this equivalent paraboloid (not 
shown in Fig. 13). This angle of incidence must satisfy eq. (7). One can 
verify from the figure that 


tan a + mg tan B 


, (16) 
1 — mg tan a tan B 


tan p = 


a and @ being the angles (see Fig. 7a) of the central ray and the axis of 
3 with respect to the axis of Yo, and 


| VoF | _ eat 1 


7 ; (17) 
|VoFo]| e2-1 
€ being the eccentricity of the hyperboloid 22. Also, 
21 = 90° + 26 — 2a, (18) 


* Of course, this is not the only requirement that must be satisfied. Other requirements 
will be discussed in an article being prepared. 
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HYPERBOLOID 5, a Me: 
ae 





Fig. 12—A vertical feed and two reflectors with i + p = 45 degrees producing a horizontal 
beam without symmetry distortion. 


and from eq. (7), solving for M, 
= — (19) 
tani + tan p 
Using eqs. (16) to (19), one can express M directly in terms of a, 8, 
mo. 

An important property of Figs. 12 and 13 is that there is no aperture 
blockage even for relatively large values (as large as 30 degrees) of the 
angular width 0; of the beam radiated by the feed. Another important 
property, to be discussed in a future article, is that, if the feed is slightly 
displaced so as to cause a small angular displacement of the antenna 
beam, the resulting aberrations are very small. This is a consequence of 
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— EQUIVALENT AXIS OF 2, +23 


fe 








Fig. 13—A vertical feed and three reflectors producing a horizontal beam without 
symmetry distortion. 
condition (7), and it implies that several beams can be produced effi- 
ciently by placing several feeds in the focal plane. 


IV. CONCLUSIONS 


The transformation of a symmetrical beam by an arbitrary arrange- 
ment of N confocal reflectors has been studied. It has been shown that 
it is always possible to choose the principal ray (i.e., the axis of the input 
beam) so that symmetry is preserved by the transformation. This is a 
consequence of the principle of equivalence shown in the appendix, ac- 
cording to which an arrangement of several reflectors can always be re- 
placed by a single reflector producing the same transformation. Thus, 
in order that symmetry be preserved, the principal ray must coincide 
with the axis of symmetry of this equivalent reflector, i.e., the equivalent 
axis. A property of the equivalent axis is that the path of a ray having 
initially its direction becomes closed after 2N successive reflections. 
Because of this property, the equivalent axis can be found by a 
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straightforward geometrical procedure if the last reflector is a paraboloid. 
A simple relation [eq. (11) or (12)] has been given for determining the 
angle of incidence and the magnification of the first or last reflector so 
as to guarantee symmetry. In Section III, the problem of modifying the 
horn reflector to eliminate the asymmetry and cross-polarization due 
to the paraboloid has been discussed. Two solutions have been de- 
scribed. 


APPENDIX 


General Properties of a Sequence of N Confocal Reflectors 

The results of this paper are consequences of the principle of equiv- 
alence stated at the beginning of Section II. This principle is now de- 
rived. 

As pointed out in the introduction, the reflectors we consider are el- 
lipsoids, hyperboloids, or paraboloids; let Fo, F1,---, Fn be N + 1 ar- 
bitrary points, let a point source be placed at Fo, and let a sequence of 
N reflectors 21,-++, Z~ be used to successively transform the spherical 
wave from Fy into spherical waves through Fo, Fi, +--+, Fy. The nth re- 
flector, 2,, with its focal points of F,-; and F,, then transforms the 
spherical wave incident from F,,-; into a spherical wave through F,,. 

Draw two spheres S and S’ centered at Fo and Fy. For each point P 
of S, there is, on S’, a corresponding point determined by the ray through 
P. This mapping has the following properties. 

A circle on S’ corresponds to each circle on S. In fact, it is well 
known!2:13 that a circular cone of rays from F,,-1 is transformed by the 
nth reflector into a circular cone of rays through F,,. 

The mapping is conformal,* and therefore two orthogonal curves of 
S are transformed into two orthogonal curves of S’. 

Another property is that, if the point source at Fo is linearly polarized 
and the lines of the electric field E on S are given, then the corresponding 
lines defined on S’ by the above mapping give correctly the lines of E on 
S’. This result is true in general!4 for arbitrary reflectors, not necessarily 
paraboloids, hyperboloids, or ellipsoids. It allows the polarization of S’ 
to be determined straightforwardly once the relationship between cor- 
responding rays through Fy and Fy) is known. . 


A.1 The central rays 


Draw a line through Fo, to cut the sphere S at two antipodal points. 
We show that it is always possible to choose the line orientation so that 
the corresponding points of S’ are also antipodal points. 


* This property is valid in general for an arbitrary wavefront S which is transformed by 
an arbitrary number of reflections (by arbitrary reflectors, not necessarily of the type 
considered here) into a wavefront S’. The mapping determined between S and S’ by the 
rays orthogonal to S (and S’) is a conformal mapping. 
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Let Ly, Lz and My, M2 be antipodal points of S (see Fig. 14; the sphere 
S is not shown). Let L;, L, and Mj, M3 be their corresponding points on 
S’. Through L}, L5 Mj, M2 draw two great circles. The two circles will 
intersect in two antipodal points 0; and 05, as shown in Fig. 14. We show 
that the corresponding points are also antipodal points. In fact, 0; and 
02 are the points of intersection of the two circles of S that correspond 
to the two circles of S’. Since the circles of S contain the antipodal points 
L,1, Lz and My, Mo, they are great circles and therefore their intersections 
0, and 02 are antipodal points. Q.E.D. 

An important significance of the points 01, 02, 0;, and 0. is the fol- 
lowing. Let a ray from Fo be reflected by the sequence of N reflectors 
twice, first in the order 2, Z9,-++, 2~ and then in the reverse order Dn, 
ZnN-1*++, 1. After these 2N reflections, the ray will pass again through 
Fo, but its direction will in general differ from the direction given ini- 
tially, and therefore the ray will not in general follow the same path if 
reflected 2N more times. However, a little thought shows that, since the 
three points 01, Fo, 02 are collinear and so also are 0;F'n09, the path of 
a ray from 0; (or from 02) will become closed after 2N reflections. The 
same observation applies to the ray from 02, which will follow, in the 
opposite direction, the same path of the ray from 0}. 

The path of the rays from 0; and 02 will be called the central path and 
the two rays central rays. This definition is consistent with the one given 
in Section II. As we shall see, there is in general only one central path, 
except when both 2, and 2y are paraboloids (see Section 2.6) or when 
the equivalent reflector is a flat plate [m. = 1 in eq. (21)]. 

The axial ray F'o0; is now chosen as reference axis. Let a particular 
plane through this ray be chosen as reference plane. Consider a particular 
ray from Fo, and let @ be its angle with respect to the axis and ¢ the angle 
its plane makes with the reference plane. After N reflections, both the 
ray in question and the axial ray pass through Fy. Let 0’ be the angle 
between the two rays at F'y, let ¢’ be the angle their plane makes with 
an arbitrary reference plane (chosen through the axial ray). We wish to 





\ 
SPHERE S’ 


Fig. 14—How to determine the central rays. 
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show that 
b = +6 + d0 (20) 


and 
0’ 0 
tan —- =m, tan-, 21 
A e tan, (21) 


where m, is a constant determined by the N reflectors and ¢o is deter- 
mined by the orientation of the two reference planes which will be chosen 
so that 


go = 0. (22) 


A.2 Derivation of eqs. (20) and (27) 


First consider on S a great circle, through the two axial points 0; and 
Oo, given by 


g =a, (23) 


where a is a constant. Since the corresponding circle on S’ must pass 
through 0; and 04, it is a great circle, given by 


fp’ =a’, (24) 


where a’ is a constant. This shows that ¢’ depends only on @, not on @. 
We now recall that the mapping of S’ must be conformal and therefore 
the angle between two circles through 0; must equal the angle between 
the corresponding circles of S. This implies eq. (20). 

Next we derive eq. (21). Since the sign in front of ¢ in eq. (20) depends 
on the definition of ¢’, and can therefore be chosen arbitrarily, we choose 
for the following derivation 

¢’ = >. 
Since a circle 6 = constant is orthogonal to a circle ¢ = constant, the 
corresponding circles on S’ must be orthogonal. This implies # is a 


function of 6 only. To determine this function, consider on S three points 
of coordinates: 


(6,6), (0+d6,b), (0,6+d¢). 
Let 


(0,6), (0 + d0’,6)(0',6 + dg) 


be the corresponding coordinates on S. Let dé; and dé_ denote on S the 
distances of the first point from the other two. Then 


dé,=rd6, dé .=rsin 6d¢, (25) 


r being the radius of the sphere S. Similarly, for the corresponding dis- 
tances on S’, 


dé,=r'd0’, dé,=r'sin 6’dd. (26) 
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Since the mapping is conformal, one must have 


dey _ dey 

dl, dbs’ 
which gives the condition 

dé av’ 








ar can : (27) 
sin@ sini’ 


Integrating this gives eq. (21), where m, is a constant of integration. 
When N = 1, eqs. (20) and (21) are nothing new. In fact, then the re- 
flector system reduces to a single reflector whose eccentricity determines 
the parameter m,. When N > 1, eqs. (20) and (21) show the N reflectors 
are equivalent to a single reflector with eccentricity specified* by mg. 
A.3 Derivation of eqs. (11) and ( 12) 
Consider the ellipsoid shown in Fig. 15. Then 
tan a tana’ =1 (28) 
and 
tan o’ tan’ = tana tany =m, 
where 
_ |FoVol 
[FiVol 
Therefore, taking into account that y = 90° —y—Y, 
1 — tan? y tan? a 
tan ¥(1 + tan? a) 
Also, i = 90° — a — y, and therefore 
1 —tanatany 


any = ee (31) 
tan a + tan py 


Now the magnification M of J is defined as 


M=- [IF ol (32) 
[IF | 
and from Fig. 15 is related to the angles y and a, 
sin 2a 
_tany 1+ tan’a 


tana 1+ tan? yp’ 


(29) 


tan y = (30) 


(33) 


* The value of m, can be calculated using the formula 
Me = N; 
where M,, ---,Mwn are the magnifications calculated for the N reflections of the central 
ray chosen a as reference axis. The sign of me depends on the sign convention for ¢ in eq. 
(20). 
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Fig. 15—Two successive reflections by a concave ellipsoid. 


which gives 


M__ tan (1 + tan? a) (34) 
M-1 (tany+tana)(1+tanytana) 


From eggs. (80), (31), and (34), one obtains eq. (11). The derivation of eq. 
(12) is entirely analogous. The case where the reflector is convex, or is 
a hyperboloid, can be treated in the same way. 
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Radiation Patterns from Parallel, Optical 
Waveguide Directional Couplers—Parameter 
Measurements 
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A new method for measuring the parameters of optical, parallel, 
waveguide directional couplers is presented. Basically, we observe the 
changes in radiation pattern obtained by placing a high, refractive, 
index coupling prism on the coupled guides as a function of position 
along the coupler. For a coupler, ina Z cut, Ti-diffused LiNbO3 sub- 
strate with 3-um guides and 3-um separation, the transfer length is 
about 1.8 mm at 7266 A. 


1. INTRODUCTION 


Parallel coupled waveguides are the basic building block for a number 
of integrated optical devices; these include switches,!-§ modulators, and 
channel dropping or adding filters.’ The techniques used to measure 
coupling parameters are often visual in nature. The simplest approach 
is to observe the energy exchanges between the parallel guides from the 
surface scattering of these guides viewed through a microscope. However, 
this is not always feasible; e.g., operation at longer wavelengths away 
from the visible, with low-loss surface scattering guides, and in cases 
where the energy at the surface is rather low, as it happens with Ti dif- 
fused guides in LiNbOs3. In such cases, the technique developed by Os- 
trowsky et al.® is quite useful. They observed the fluorescence from 
RhB-doped polyurethane film over the strip guides pumped by an argon 
laser. 

In this paper, we present a method found useful in measuring the 
parameters of such couplers. Basically, the method consists of obser- 
vation of the interaction length dependence of the coupling via radiation 
pattern measurements;? the radiation patterns are obtained by moving 
an output coupling prism along the coupled waveguide region. 


2685 


ll. THEORY 
2.1 Synchronous couplers 


Figure 1 depicts two coupled parallel waveguides where a is the guide 
width, c is the guide spacing, and L is the length over which the guides 
are coupled; i.e., the interaction length. We consider the ideal case which 
assumes that the guides are identical in width and thickness so that 
perfect synchronism of the unperturbed propagation constants exists; 
for this case, the normalized field amplitudes in the two guides as a 
function of length z can be shown to be!® 

R cos KZ (1) 

S =] sin xz, 
where R is the field amplitude in the initially excited guide, S is that of 
the auxiliary guide, and «x is the coupling strength per unit length. We 
are interested in determining the coupling strength x per unit length for 
a coupler of known physical parameters; knowledge of x permits the 
selection of L for a coupler of desired overall coupling strength. If, at 
some point z along the parallel coupled region, we place a prism whose 
refractive index is higher than that of the waveguides, then power will 
be radiated from the two waveguides. Thus in the far field we observe 
a radiation pattern due to the interference of the fields from the coupled 
waveguides over the coupling length of the prism coupler. If we keep the 
prism coupling length small compared to 1/k, say, less than a millimeter, 
then the far-field radiation pattern would truly be representative of the 
pattern from two slits separated by a distance of d having relative am- 
plitudes given by eq. (1). 

If we assume constant transverse field amplitudes, as seen from Fig. 
2, the expression for the radiation pattern is 


\E|?= 





cane 
— (1 + sin 2Z sin 2u “) ; (2) 
a 


where 
Z= Kz 


Ta , 
u = — sin 9 
ON 


Fig. 1—Parallel waveguide directional coupler where a is guide width, c is the guide 
spacing, and L is the interaction length. 
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SYNCHRONOUS COUPLER 
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Fig. 2—Radiation field amplitudes under far-field conditions due to a pair of sources 
of width a, separated by a distance d. 6 is measured in the plane perpendicular to the plane 
containing the waveguides. 


d=ate 
6 is radiation angle. 


Figure 3 shows computed plots of |#|? as a function of u for the case d/a 
= 2 with Z as the parameter. Except when all of the energy is in one 
guide, e.g., at Z = o, the radiation pattern is asymmetrical about 6 = o. 
This is true even for the case when Z = 7/4, when the field amplitudes 
in both guides are equal, and differ by a phase shift of 90 degrees. When 
Z is increased from 7/4 to 7/2 in specific increments, the patterns remain 
the same as Z is varied from 7/4 to 0, for the same shape, i.e., for example, 
identical patterns are observed for the cases when Z = 7/16 and 77/16, 
a/8 and 37/8, 37/16 and 5z/16, etc. At Z = Zo = 7/2, complete energy 
transfer occurs. When Z is varied from z/2 to 37/4 and back to z, the 
graphs shown in Fig. 3 can be used with change in sign of abscissa. The 
whole series of patterns repeat themselves in this manner with increasing 
Z. 


2.2 Asynchronous couplers 


If the waveguides differ in width, thickness, or refractive index, their 
propagation constants will differ. This could occur as a result of errors 
in the fabrication process. For such asynchronous couplers, complete 
power transfer from one guide to the other is not possible. If we define 
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Fig. 3—Computer plots of energy distribution as a function of normalized radiation angle 
u for the case d/a = 2, with Z = xz as the parameter. 


the difference in unperturbed propagation constants in the two guides 
as AG, then the normalized field amplitudes® as a function of z be- 
come 


Pies pa ee Seer 1 
R’ = cosa lee 
sin a 
Sat ep (3) 
Vata 
where 
y = AB/2k 
a= Vy? +1 Kz. 


Here, again, R’ is the field amplitude in the initially excited guide and 
S’ is that of the auxiliary guide. With these field amplitudes, the radia- 
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tion pattern is given by 


no _sintu 7, sin Q(y?+1)?Z . 7, 
|E’|2 = = ( (2+ DV sin (2u=) 
d 
= a 7 [1 — cos 2(y2 + 1)!/2Z] cos (2u ")). (4) 


The power in the coupled guide is obtained by squaring eq. (3) and is 
given by 
sin?[(y? + 1)1/2Z] 
(y? + 1) 
and (5) 
[R’]2=1—-|S’]2. 


Is?= 


We find the maximum value for the coupled power to be (y2 + 1)7! 
at Z = (mz/2)(y? + 1)-!/2. Plots of |E’| show the expected result that 
the information content in the radiation patterns decreases rapidly with 
increasing asynchronism. However, useful information is obtained by 
recognizing the transfer period as indicated by all the power being 
present in the input guide. 


lil. COUPLER FABRICATION AND MEASUREMENT TECHNIQUE 


The procedures used in the fabrication of the experimental couplers 
are described. Z-cut lithium niobate substrates were coated with poly- 
methyl-methacrylate (PMMA) electron resist approximately 0.5 micron 
in thickness. A thin layer of aluminum (100 A) is evaporated onto the 





Fig. 4—Guide tracks defined in PMMA after electron beam exposure and development. 
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Fig. 5—Illustration of the setup to measure the coupler radiation pattern. 


PMMA to eliminate charging problems. The coated substrate is then 
mounted onto a scanning electron microscope (SEM) stub using a con- 
ducting silver paste. Using the appropriate scan generator, the first guide 
of the coupler is exposed. The scan generator output amplitude is then 
attenuated and the writing beam moved by electronic adjustment of the 
fine shift coil current; the auxiliary guide is then exposed. For exposure, 
a specimen current of 10-9 A is typically used with an exposure time of 
about 25 s to obtain 3-um wide guides 15 mm in length. The sample is 
then removed from the SEM. A brief soak in dilute NaOH removes the 
aluminum layer. The PMMA is then developed for about 30s in a 3-to-1 





Fig. 6—A typical radiation pattern—in this case, the energy is very close to the position 
where all the energy is one of the guides. 
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Fig. 7—Radiated power output as a function of normalized radiation angle u where all 
the energy is in one of the guides, z = 0. d/a = 2 for this coupler. 


mixture of isopropyl alcohol and methylethyl ketone. The guide tracks 
are now defined in the PMMA (Fig. 4). The sample is blown dry with dry 
nitrogen and mounted in a sputtering system for deposition of a Ti layer 
usually about 300 A thick. The PMMA and excess Ti are next removed 
by soaking the sample in acetone. At this point, we have a sample with 
Ti where we want the waveguides. The sample is next placed in an oven 
and brought to 1000°C in an argon ambient. Following the 1000°C 
soaking for about three hours, the furnace is turned off and the ambient 
changed to oxygen. The resulting guides exhibit single TE mode opera- 
tion. 

The experimental set-up used to measure the coupler radiation pat- 
tern is shown in Fig. 5. The lasers employed were He-Ne operating at 
6328°A and a nile-blue dye laser covering the wavelength 6900°A to 
7500°A. The latter source was pumped by the 6471°A line of a krypton 
laser. The prisms were made of rutile. The input prism was quite flat, 
allowing strong coupling, whereas the base of the output prism had a 
curvature in it to ensure the coupling region to be much less than that 
of a millimeter. Although the amount of energy coupled out is rather 
small, the resulting radiation pattern is primarily due to the energy of 
the guide at the output prism location and does not include the effects 
of long coupling lengths. As the output prism was moved along the 
guides, the radiation pattern was scanned using an iris. Figure 6 is a 
photograph of a typical nearly synchronous coupler radiation pattern. 
The pattern in Fig. 7 resulted from a coupler operating at 7266°A con- 
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Fig. 8—Radiation pattern of the same coupler shown in Fig. 7, but at a different z ~ 
57/8, past the location of complete energy transfer. 


sisting of 3-um guides. The measurement was made ata position Z = 0 
along the coupler. Where all power was essentially in one of the guides, 
measured distribution agrees well with the theory. By moving the output 
prism to a place where all the energy is in the other guide, the transfer 
length can be measured. However, if the prism is not placed exactly at 
this location, one can infer this information by noting the nature of the 
asymmetry and measuring the radiation pattern. For example, Fig. 8 
shows the output radiation pattern for the same coupler, but at a dif- 
ferent longitudinal position z = 2.25 mm. In this case, the power in the 
two guides is nearly equal, resulting in sidelobe development in the ob- 
served radiation pattern. From Fig. 3 for d/a = 2.0, the separation Au 
between minima is 1.6. At 7266°A, for a = 3 um, this translates into a 
separation Ax = 2.24 cm between minima at a distance y = 18cm from 
the output prism coupling position. This compares favorably with the 
measured value of 2.25 cm. By a series of observations on this coupler, 
we can infer an interaction length for full power transfer L, = 1/2k; the 
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best fit for curve in Fig. 8 occurs at Z = 7z/2L, = 57/8, from which the 
transfer length L, is inferred to be 1.8 mm for this coupler. The sepa- 
ration of the minima agrees very well, although the peaks do not. Con- 
sidering that we analyze uniform distribution of energy in the wave- 
guides, the agreement is rather good. 


IV. CONCLUSION 


We have described a method for measuring the coupling strength of 
synchronous optical waveguide directional couplers by observing the 
length dependence of the radiated signal. As indicated earlier, the 
technique is useful, with care in implementation, as a laboratory. tool. 
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Speech signal power at the main distributing frame in class 5 
switching offices is characterized in terms of equivalent peak level (EPL) 
and average conversational signal power measures. The results indicate 
that there is little dependence of speech signal power on call destination 
or originating class of service. Small differences between various sub- 
populations are explained for the most part by loop characteristics. 
The switched telecommunications network is essentially transparent 
to customers in the sense that talker signal power has not been found 
to be sensitive to factors which affect the transmission path between 
class 5 central offices. 

Present-day speech volumes for toll calls, which average —21.6 VU 
(volume units), are substantially lower than those found in a survey 
conducted in 1960,! which averaged —16.3 VU, and the ranges of vol- 
umes within all call destination categories are substantially smaller 
than the 1960 ranges. Several substantial changes have been introduced 
into the telephone plant since 1960 which tend to increase the unifor- 
mity of service in the network from the viewpoint of speech volumes. 
These include a decrease in the proportion of toll grade battery, loss 
plan improvements, replacement of the 300-type telephone set with 
the 500-type set, and an increase in direct trunking between class 5 
offices. 


I. INTRODUCTION 


The characterization of speech signal power on Bell System message 
circuits is an essential step in the determination of signal power loading 
and crosstalk objectives. Knowledge of speech signal characteristics is 
also important to designers of a wide variety of telecommunications 
equipment. 

Speech levels at the class 5 office were last characterized in the 1960 
Speech Volume Survey! in terms of volume units (VU). In the years since 
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the last survey, there have been substantial changes in the Bell System 
network. For example, the proportion of toll grade battery has been 
substantially reduced, the 300-type telephone set has been almost 
completely phased out, direct distance dialing is now virtually universal, 
and new loop and trunk design methods have been introduced. Also, in 
the intervening years, research in speech signal measurement has led 
to a new measure of speech level known as the equivalent peak level 
(EPL).? This, together with advanced digital data acquisition technology, 
has facilitated the measurement of speech signal power with greater 
precision than was possible in 1960. 

This paper presents the results of a speech signal power survey made 
in 1975-1976. The measurements were made at 36 class-5-office main 
distributing frames (MDFs), which constitute a statistical sample of ac- 
ceptable precision from all the MDFs within the Bell System. The class 
5 (local or end) office MDF was selected as the measurement interface 
because it has access to all customer loops and all classes of local and toll 
traffic; dialed address information is readily available; only the cus- 
tomer’s loop and station equipment is interposed between the customer 
and the point of measurement; and the customer’s loop current may be 
measured. A three-stage statistical sampling scheme was employed, 
which resulted in measurements of near-end and far-end talker power 
on more than 10,000 calls originating from approximately 2500 loops. 
Average conversational signal power (averaged over the entire obser- 
vation interval) and EPL were the measures used for talker signal char- 
acterization. Loop dc current, class of service, switch type, and call 
destination were also recorded. 

Survey results are presented in Section II, the methodology is pre- 
sented in Section III, and comparisons of the present survey results with 
prior survey results are given in Section IV. 


il. SURVEY RESULTS 


Table I summarizes the findings of this survey. The results indicate 
that there is little dependence of speech signal power on call destination 
or originating class of service. In the sections that follow, it is shown that 
the small differences between various subpopulations are explained for 


Table I—Summary of speech signal powers 


Near-End Mean 
Equivalent Peak Level 
(dBm) 


Subclass 
Residence -11.0 
Business —10.4 
Local —10.8 
Toll —10.1 
Combined —10.7 
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the most part by loop characteristics, and there is little if any variation 
in speech signal power that may be attributable to psychological factors 
such as call distance, perception of received volume, etc. The indication 
from the data is that the switched telecommunications network is es- 
sentially transparent to customers in the sense that talker signal power 
has not been found to be sensitive to call distance, local or toll call clas- 
sification, or other factors that affect the transmission path from class 
5 to class 5 central office. 


2.1 General 


In this survey, speech signal power measurements were made on 
customer loops at class 5 switching office main distributing frames 
(MDFs) during actual telephone conversations. The parties originating 
calls on sampled loops are referred to as the “near-end” speakers in the 
following discussion; the called parties are referred to as the “far-end” 
speakers. The far-end speakers were more distant than the near-end 
speakers from the MDFs at which the measurements were made, except 
for some intrabuilding calls. 

The survey results characterize near- and far-end speech signal powers 
in terms of the equivalent peak level (EPL) and average conversational 
signal power measures, which are discussed in Section 3.3.3. The dif- 
ferences are also characterized between near- and far-end signal powers 
and between the EPL and average power measures. In addition, the in- 
fluences of loop current, originating class of subscriber service, call 
destination, call distance, originating switch, and demographic features 
upon speech signal powers are investigated. 


2.2 Speech signal powers at main distributing frames 


The distributions of speech signal power at main distributing frames 
can be approximated by normal distributions. Histograms and cumu- 
lative distribution functions (CDFs) are given for the EPL and average 
power measures of speech signal power for the near- and far-end speakers 
in Figs. 1 through 4. The “bell” shapes of the histograms and the straight 
line shapes of the CDFs, which are plotted on normal probability grids, 
attest to the normality of these distributions. Because of this, the dis- 
tributions are completely defined by the means and standard deviations 
listed in the first four lines of Table II. 

While the near- and far-end signals encounter similar populations of 
station set and subscriber loop losses, the far-end signals also encounter 
end-office-to-end-office transmission losses. As a result of these addi- 
tional losses, which will be referred to as the “apparent network loss” 
during the remainder of this paper, the average far-end signal power is 
generally lower than the average near-end signal power. The apparent 
network loss is a function of call destination, i.e., the greater the call 
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Fig. 1—Near-end equivalent peak level (dBm) distribution. 
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Fig. 2—Far-end equivalent peak level (dBm) distribution. 


distance between end offices the more the signals are attenuated. This 
source of variation explains the greater variability among the far-end 
signal powers. These near-end, far-end differences exist for both EPL 
and average power; however, a comparison of the near- and far-end EPL 
results gives a difference of 2.1 dB, while a similar comparison for the 
average power measures gives a difference of 2.9 dB. In the following 
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Fig. 3—Near-end average signal power (dBm) distribution. 
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Fig. 4—Far-end average signal power (dBm) distribution. 


paragraph, this apparent anomaly is shown to be caused by a difference 
in the speech activity of near- and far-end speakers. 

The EPL, which is derived from the speech samples exceeding a 
threshold, is a measure of the speaker’s peak signal power, and therefore 
is unaffected by silent periods in the conversation. The average signal 
power for conversational speech, however, includes intervals of speech 
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and silence alike. Therefore, the average power measure is lower than 
the corresponding EPL. This is illustrated by the results in Table I, 
which show that the average difference between EPL and average power 
is 14.6 dB for the near-end measures and 15.6 dB for the far-end mea- 
sures. Such differences represent activity factors in the sense that they 
are logarithmically related to the amount of silence during a conversa- 
tion.? They indicate that calling parties (near-end) tend to speak more 
than called parties (far-end) during telephone conversations. Due to 
these different speech activity characteristics, the apparent network loss 
result based upon average power is overestimated by about 1 dB. This 
finding explains the apparent anomaly noted above, and suggests that 
EPL is more appropriate than average power for estimating apparent 
network loss. 

Comparisons of near-end EPL and average power with the far-end 
measurements are provided in the scatter diagrams in Figs. 5 and 6. The 
correlation coefficients are 0.31 and 0.57 for the EPL and average power 
comparisons, respectively. While the relationships are statistically sig- 
nificant, the modest positive correlations indicate that the signal power 
of one speaker is not strongly influenced by the signal power of the 
other. 

Average signal power is strongly related to EPL. The results of the 
linear regressions of the near- and far-end EPLs on the corresponding 
average powers are given in Figs. 7 and 8, respectively. The near-end 
regression shows that average power = —14.27 + 1.04 EPL, and the far- 
end regression shows that average power = —15.40 + EPL. The values 
of R2, the square of the correlation, on the figures indicate that ap- 
proximately 85 percent of the variation in average signal power is ac- 
counted for by the regression fits. 

Signal power at the MDF is dependent upon loop loss and the tele- 
phone set electroacoustic efficiency. While these parameters were not 
measured, the near-end loop current, which was measured, has been 
found to relate to the overall loop and telephone set loss.* The histogram 


Table II—Systemwide speech signal power results 


Transmission Characteristic Mean 90%C.I. Std.Dev. Sample 
Near-end EPL (dBm) —10.7 +0.5 4.6 10251 
Far-end EPL (dBm) —12.7 +0.4 5.2 8976 
Near-end average power (dBm) —25.3 +0.5 5.3 10251 
Far-end average power (dBm) —28.3 +0.4 5.6 8976 
Near minus far-end EPL (dB) 2.1 +0.4 5.9 8478 

- Near minus far-end average power (dB) 2.9 +0.4 6.7 8478 
Near-end EPL minus average power (dB) 14.6 +0.1 2.1 10251 
Far-end EPL minus average power (dB) 15.6 +0.1 2.1 8976 
Near-end loop current (mA) 42.2 +1.9 12.8 10749 


90% C.I. = 90-percent confidence interval for the mean estimate. 
Std. Dev. = Standard deviation of the signal power or loop current population. 
Sample = Total sample size in calls used to calculate estimates. 
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Fig. 5—Comparison of near- and far-end equivalent peak level. 


and CDF for loop current are given in Fig. 9. The distribution is positively 
skewed, which means that it deviates from normality due to some large 
values of loop current associated with short loops. The distribution also 
deviates from normality at the lower tail because of a truncation of loop 
currents below 20 mA due to engineering limitations for signaling and 
transmission systems. Table II shows that the average loop current is 
42.2 mA and the standard deviation is 12.8 mA. 

Near-end EPL and average power are plotted as a function of loop 
current in Figs. 10 and 11, respectively. The scatter diagrams indicate 
that EPL and average signal power increase as loop current increases. 
Loop and telephone set characteristics suggest that a nonlinear rela- 
tionship exists between loop current and the total loop and telephone 
set loss.4 Nonlinear regression confirms this; however, the improvement 
in fit over the linear model, while statistically significant, is not of 
practical interest. The linear regressions of EPL and average power on 
loop current indicate that signal power increases about 0.13 dB per 1.0 
mA increase in loop current. However, signal power varies substantially 
about the regression lines, indicating that loop current alone is not a good 
predictor of signal power. Visually, the variance appears to depend upon 
loop current; however, an analysis within loop current categories indi- 
cates that the variance is constant. 
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Fig. 6—Comparison of near- and far-end average power. 


A more vivid illustration of the relationship between signal power and 
loop current is given in Fig. 12 by plotting the average EPL for each of 
the 36 MDFs in the sample as a function of the average loop current per 
MDF. The scatter shows a positive correlation, and the correlation 
coefficient is 0.82. A linear regression indicates that average EPL = 
—19.06 + 0.20 average loop current, and that the regression fit accounts 
for 67 percent of the variability in average EPL among MDFs. 


2.3 Signal power and class of service 


Class of service identifies the subscriber as a business or residential 
customer and identifies the station terminals as Bell or customer-pro- 
vided equipment (CPE). The analyses discussed in this section deal with 
these service perspectives on the basis of originating class of service. The 
terminating customer class of service was not determined for the calls 
in this survey. 


2.3.1 Business versus residential 


The survey results for business- and residential-originated calls are 
summarized in Table III. Comparisons of the near-end EPL and average 
power results indicate that business-associated signal powers tend to 
be slightly higher than residential-associated signal powers, and that 


2702 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1978 


AVERAGE POWER = —14.27 + 1.04 EPL 
R2 = 0.84 
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Fig. 7—Linear regression of near-end average power on equivalent peak level. 


the variability among signal powers is about the same in both service 
categories. The 90-percent confidence intervals for the business and 
residential averages overlap, indicating that the differences are not 
statistically significant. Business loop currents are significantly higher 
and more variable than residential loop currents. The 5.3-mA difference 
in average loop current combined with the finding in Section 2.2, which 
indicates that EPL increases 0.13 dB per 1.0 mA increase in loop current, 
suggests that the business average EPL should be about 0.7 dB higher 
than the residential average. This difference agrees with the residence- 
business difference found for the near-end talker. 

The far-end signal power results derived from the analysis by origi- 
nating class of service are almost identical in the business and residential 
classifications. Since the originating parties in either category place calls 
to business and residential stations alike, the far-end speakers in each 
originating class of service category represent a mixture of business and 
residential customers. The far-end class of service mixture within each 
originating class of service category is sufficiently close to the overall 
traffic composition that the far-end results in each category are essen- 
tially the same as the far-end results for all telephone traffic listed in 
Table II. It is interesting to note that, although the average calling dis- 
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Table IlI—Originating class of service speech signal power results 
Business Residential 
Transmission Characteristic Mean 90% CI. Std.Dev. Sample Mean 90% C.I. Std.Dev. Sample 
Near-end EPL (dBm) —-10.4 +£0.7 4.6 6072 —11.0 +0.4 4.7 4179 
Far-end EPL (dBm) —12.8 +0.5 5.2 5228 —12.7 +0.4 5.2 3748 
Near-end average power (dBm) —25.0 +0.8 5:2 6072 —25.7 - £0.4 5.4 4179 
Far-end average power (dBm) —28.4 +0.5 5.5 5228 —28.2 +0.4 5.6 3748 
Near minus far-end EPL (dB) 2.5 +0.7 6.0 4916 L.7 +0.3 5.8 3562 
Near minus far-end average power (dB) 3.4 +0.7 6.7 4916 2.5 +£0.3 6.6 3562 
Near-end EPL minus average power (dB) 14.6 +0.2 21 6072 14.7 +0.1 2.1 4179 
Far-end EPL minus average power (dB) 15.7 +0.1 2.1 5228 15.5 +0.1 2.1 3748 
Near-end loop current (mA) 45.0 +2.9 13.9 6384 39.7 +1.2 11.1 4365 





AVERAGE POWER = —15.40 + EPL 
R2 = 0.86 
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Fig. 8—Linear regression of far-end average power on equivalent peak level. 


tance for the business-originated calls (50 + 12 miles) is over 3.5 times 
the average for the residential calls (14 + 4 miles), there is no noticeable 
call distance impact upon far-end talker received signal power. This does 
not imply that call distance has no influence upon network loss; it does 
imply that most of the data represent local calls or very short toll calls, 
and thus any potential call distance influence is not apparent. 

Speaker speech activity during a telephone conversation is not affected 
by originating class of service. The EPL-average power differences have 
similar distributions for business- and residential-originated conver- 
sations. 

The signal power distributions are all close to normal for business and 
residential calls. Therefore, the EPL and average power results listed in 
Table III completely define the signal power distributions for all practical 
applications. The business and residential loop current distributions 
differ significantly and are presented in Figs. 13 and 14, respectively. 
The business loop current distribution is comparable to the 1964 General 
Loop Survey” computed loop current distribution. The residential dis- 
tribution has a greater proportion of lower current loops than the 1964 
Survey result. 
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Fig. 9—Near-end loop current (mA) distribution. 


2.3.2 Bell versus customer-provided equipment 


Business calls are further classified on the basis of terminal equipment 
ownership in this section. One category contains those business calls 
which originated from subscriber lines with terminal equipment leased 
from the Bell System, and the second category contains those calls which 
originated from subscriber lines with customer-provided equipment 
(CPE). The results of this analysis are tabulated in Table IV. The near- 
end estimates show that the Bell signal powers on the average are more 
than 2 dB higher than the CPE signal powers, and that they are also 
somewhat less variable. The reason for this difference is suggested by 
examining the relationship between loop current and EPL for Bell and 
CPE loops, respectively. The correlation coefficients are 0.39 and 0.16, 
respectively, indicating that speech signal power on CPE loops is less 
strongly influenced by loop current than in the case of Bell loops. The 
reason for this is that the CPE station equipment battery is provided by 
-a local power supply and not over the metallic loop facility. Thus, the 
electroacoustic efficiency of CPE station equipment is unrelated to the 
loop current observed in the central office, and the lower mean and 
higher variance in signal power may be attributable to the various local 
battery supplies and electroacoustic efficiencies of CPE terminals. 

Comparisons of the far-end signal power estimates indicate that those 
far-end signals associated with CPE-originated calls have slightly lower 
signal powers than those associated with Bell-originated calls. The ab- 
sence of detailed information about the far-end customers prevents 
further analyses to determine the cause of this difference. 


2706 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1978 


NEAR-END EPL IN dBm 


EPL = —16.26+ 0.13 LC 
R2 = 0.12 





at 
10 30 50 70 90 
NEAR-END LOOP CURRENT IN mA 





Fig. 10—Linear regression of equivalent peak level on loop current. 


The signal power distributions again are all close to normal for both 
Bell- and CPE-originated business calls. The loop current distributions 
for both categories are comparable to the distributions given in the 
previous section for business calls in general. 


2.4 Signal power and call destination 


Four categories of call destination are considered in the following 
discussion; (i) intrabuilding local calls, (iz) interbuilding local calls, (iiz) 
Home Numbering Plan Area (HNPA) toll calls, and (tv) Foreign Num- 
bering Plan Area (FNPA) toll calls. The first two of these categories 
characterize local calls, and the last two characterize toll calls. 

The trend lines in Fig. 15 summarize the relationships between signal 
power and call destination and between loop current and call destination. 
The near-end EPL and average power appear to increase slightly as the 
call destination becomes more remote from the originating office, with 
the exception of a slight drop in signal power for interbuilding local calls. 
The 90-percent confidence intervals for the four EPL estimates and for 
the four average power estimates overlap, which indicates that the dif- 
ferences among categories are not statistically significant. About half 
of the increase or decrease in signal power can be attributed to the call 
destination trend for loop current, which is plotted at the bottom of 
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Table |!V—Bell and customer-provided equipment speech signal power results 


___—s*iBell Business CPE Business 
Transmission Characteristic Mean 90% C.I. Std.Dev. Sample Mean 90% C.I. Std. Dev. Sample 
Near-end EPL (dBm) —-10.4 +0.7 4.6 2857 —12.5 +1.2 5.1 2552 
Far-end EPL (dBm) —-12.8 +0.5 5.2 2404 —14.0 +0.5 5.1 2228 
Near-end average power (dBm) —24.9 +0.7 5.2 2857 —27.5 +1.3 5.4 2552 
Far-end average power (dBm) —28.4 +0.5 5.5 2404 —29.3 +£0.4 5.1 2228 
Near minus far-end EPL (dB) 2.5 +0.7 6.0 2304 1.8 $1.3 6.2 2065 
Near minus far-end average power (dB) 3.4 +0.7 6.7 2304 1.6 +1.6 6.6 2065 
Near-end EPL minus average power (dB) 14.6 +0.2 2.1 2857 15.1 +0.1 2.4 2552 
Far-end EPL minus average power (dB) 15.7 +0.1 2.1 2404 15.3 +0.2 2.0 2228 
‘Near-end loop current (mA) 45.1 +2.9 14.0 2957 37.8 $3.2 11.3 2715 
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Fig. 11—Linear regression of average power on loop current. 


Fig. 15. The correlation coefficients for near-end EPL and loop current 
are between 0.30 and 0.40 for all four destination categories. As the loop 
current decreases or increases, the EPL and average power trend lines 
follow. Since loop currents tend to be higher for business-originated calls 
(Section 2.3.1) and since the HNPA and FNPA categories of calls have 
increasingly more business-originated traffic (intrabuilding: 34 percent, 
interbuilding: 50 percent, HNPA: 59 percent, and FNPA: 69 percent), loop 
current tends to increase as the call destination becomes more remote. 
Interbuilding local calls, however, present an exception to this behavior 
which is not understood. It may be a real deviation from the overall trend, 
or it may be a random statistical phenomenon. Since the trends are so 
slight, further investigation of the interbuilding results is not warrant- 
ed. 

Examination of the near-end EPL and average power distributions 
within the individual call destination categories shows that they are close 
to normal in all categories except the FNPA category. In the FNPA cate- 
gory, both distributions modestly deviate from normality in the upper 
10-percent tail due to a truncation of EPL around 0 dBm and a truncation 
of average power around —15 dBm. The reason for this truncation is not 
known; however, it represents a threshold above which speakers rarely 
drift. In the other call destination categories, 0 and —15 dBm signal 


SPEECH SIGNAL POWER 2709 


AVERAGE NEAR-END EPL PER MDF IN dBm 


AVERAGE EPL = —19.06 + 0.20 AVERAGE LC 
R2 = 0.67 





32 36 40 44 48 52 56 60 
AVERAGE NEAR-END LOOP CURRENT PER MDF IN mA 


Fig. 12—Linear regression of equivalent peak level on loop current using MDF aver- 
ages. 


powers fall in the highest 1 percent of the EPL and average powers, re- 
spectively. The distributions for far-end EPL and average power are 
essentially normal in all categories. 

The far-end signal powers tend to decrease as the call destination 
becomes more remote from the originating office due to increases in 
end-office-to-end-office network transmission loss. In the case of in- 
trabuilding local calls where both parties are served by the same local 
switching office, the only additional network loss encountered by far-end 
signals is the switching office loss itself. As a result, the near- and far-end 
signal powers differ only slightly for intrabuilding local calls. These 
differences increase for interbuilding local calls and HNPA calls, which 
have similar far-end signal powers, due to an increase in the number of 
switching offices and trunks involved in the transmission path and the 
via net loss design® adopted for these arrangements of facilities. Likewise, 
an even greater difference between near- and far-end signal power is 
observed in the FNPA category. The detailed statistics associated with 
the trends illustrated in Fig. 15 are listed in Table V. 

The correlation between near- and far-end signal powers also appears 
to depend upon call destination. A comparison of near- and far-end EPL 
provides correlation coefficients of 0.36, 0.27, 0.28, and 0.14 for intra- 
building, interbuilding, HNPA, and FNPA calls, respectively. The cor- 


2710 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1978 


0.995 ee 
0.990 


0.970 
0.950 


0.900 
0.850 
0.800 
0.700 
0.600 
0.500 
0.400 
0.300 
0.200 
0.150 
0.100 


PROBABILITY 


0.050 
0.030 



































































































































0.010 | 


0.005 os 
10 30 50 70 — 90 10 30 50 70 90 


Fig. 13—Near-end loop current (mA) distribution for business. 
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Fig. 14—Near-end loop current (mA) distribution for residential. 


relation becomes poorer as the call destination becomes more remote 
because of the overall increasing and opposite impacts of network 
transmission loss and loop current on far-end and near-end signal 
powers, respectively. 

The intrabuilding and interbuilding local call data were pooled to 
obtain overall local results, and the HNPA and FNPA data were pooled 
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Fig. 15—Relationship of call destination to signal power and loop current. 


to obtain overall toll results. Briefly, near-end toll signal powers are 
slightly, but not significantly, higher than near-end signal powers for 
local calls, and far-end toll signal powers are significantly lower than 
far-end powers for local calls. The reasons for these characteristics are 
discussed above. The only additional observation at this point is that 
the local loop current distribution resembles the residential distribution 
in Fig. 14 and the toll loop current distribution resembles the business 
distribution in Fig. 13. The dominance of residential and business traffic 
for local and toll calls, respectively, is responsible for these similari- 
ties. 


2.5 Additional speech signal power analyses 


The signal power data were also analyzed to determine the impact of 
call distance, local switch type, and several demographic factors upon 
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Table V—Call destination speech signal power results 
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speech signal power. Call distance is defined as the airline distance be- 
tween the originating and terminating local switching machines. Near- 
end signal power and loop current do not appear to be correlated with 
call distance. Far-end signal power is weakly correlated with call distance 
in a negative sense, due to the increase in network transmission loss 
which accompanies longer call distances as a result of the via net loss 
design.® 

In the second of these analyses, the data were classified by originating 
local switching machine type. No significant relationship was found 
between machine type and near-end signal power. 

Three demographic factors were considered in the third analysis. The 
first factor, geographical location, does not play an important role in 
determining speech signal power. While the average near-end signal 
power is highest in the northeast section of the country and lowest in the 
southwest, the range of the differences is only 2.7 dB, and the correlation 
between loop current and signal power accounts for about 40 percent 
of the difference between geographic areas. The second factor, city or 
town population, tends to mask rather than uncover relationships be- 
tween signal power and population. A more appropriate measure is the 
population density of the exchange served by the local telephone office. 
The third demographic factor, locality type, was defined to capture the 
impact of population density upon speech signal power. Five locality 
types were considered: downtown areas of large and midsize cities, 
downtown areas of small towns, outer-urban areas, and suburban areas. 
Large cities were defined as cities with populations of 100,000 or more 
people; mid-size cities were defined as cities with populations ranging 
from 20,000 to 100,000 people; and small towns were defined as cities or 
towns with populations of 20,000 or less people. The outer-urban clas- 
sification denotes areas with a mixture of residential dwellings and 
business establishments on the outlying fringes of large cities, and the 
suburban classification denotes areas which primarily contain residential 
dwellings. The average near-end EPL and loop current both exhibit the 
same trends with locality types. Both are highest for downtown MDFs 
in large cities and lowest for outer-urban areas. These results correlate 
with the fact that in the first case the population of customers is rather 
concentrated, and they tend to have relatively short loops, while in the 
second case the population of customers is rather widespread, and they 
tend to have relatively long loops. Between these extremes, the average 
EPL and loop current for small towns are higher than for mid-size cities, 
and both have higher averages than suburban areas. As illustrated in 
Figs. 16 and 17, the differences among the categories are not large; 
however, they do suggest a dependence of loop current and, as a result, 
EPL upon varying densities of populations. 
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Fig. 16—Average signal power per MDF by locality type. 


itl. METHODOLOGY 
3.1 Statistical survey sample plan 


The Loop Signal Power Survey sampling plan consists of three major 
components—a precise definition of the target population and param- 
eters, a scheme for the selection and measurement of a sample of calls, 
and the choice of the estimation formulas. Section 3.1.1 defines the target 
population and parameters, Section 3.1.2 describes the scheme used to 
select and measure a statistical sample of calls, and Section 3.1.3 de- 
scribes the statistical estimation and confidence interval formulas used 
to estimate the target parameters. 


3.1.1 Target population and measured parameter definitions 


The target population consists of voice calls originating over the public 
switched network where the subscriber’s loop is classified as business, 
single party residence, coin semipublic, Private Branch Exchange (PBX), 
or Centralized Exchange (centrex) service. The aggregate of subscriber 
loops in the target population are naturally partitioned according to the 
local MDF in which they terminate. In addition, the subscriber loops 
terminating in an MDF are naturally dichotomized into a customer- 
provided equipment (CPE) substratum and a Bell equipment sub- 
stratum. A loop was identified as belonging to the CPE substratum when 
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Fig. 17—Average loop current per MDF by locality type. 


the local operating company billing records and a follow-up station 
verification identified the connection to the subscriber’s loop of a pro- 
tective connecting arrangement (PCA) listed in Table VI. A PCA is de- 
signed to interconnect non-Bell terminal equipment with the Bell Sys- 
tem public switched network. 

For potential statistical advantage, the MDFs were partitioned into 
12 strata according to the average 1970 population census of the com- 
munities within the plant district where an MDF was located. The 12 
strata were constructed so that they are approximately the same size with 
respect to the total number of business, residence, PBX, centrex, coin 
semipublic, and switched data telephone lines terminating on MDFs 
within the stratum. This form of stratification was suggested by the re- 
sults of the 1960 Speech Volume Survey, which indicated a correlation 
of speech volume with city population. Stratification by city size offered 
the potential for reduction of the variability in speech signal power within 


Table VI—Protective connecting arrangements (PCA) 


Associated Non-Bell 


PCA USOC* Terminal Equipment 
STP Key telephone system 
STC Single line set 
C2ACP Single line or key telephone system 
CD8, CDH PBX or centrex CU 
CDA, CD1, CD7, CD9 Cord switchboard or console 


* USOC—Bell System Universal Service Order Code. 
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each of the strata and, as a result, an increase in the precision of estimates 
of the mean signal power. 

In general, the choice of the criterion for stratification is arbitrary and 
does not affect the validity of the final survey conclusions; however, a 
judicious choice of a stratification scheme can lead to an estimate of the 
mean with a smaller confidence interval than would be obtained other- 
wise. 

Each loop associated with the target population is indexed by its 
stratum number, MDF number within a stratum, substratum number 
(e.g., 1 Bell, 2 CPE) and loop number within an MDF substratum. 

The target population parameters estimated in the Loop Signal Power 
Survey are defined by the ratio 


R= Y/X, 


where 
Nn Dni Mhia Qhiaj 


Y= e >» 3S yz pS Yhiajrs 


h=1i=1a=1 j=1 k=1 
X is defined similarly to Y with Yniajk replaced by Xniajr, 
N», is the number of MDFs located i in class 5 offices in stratum h, 
for We = 1, 2.24225; 
Dy; 1s the fimabet of substrata into which the subscriber loops that 
terminate in the ith MDF of stratum h are partitioned (Dj; = 
2), 
Mhia is the number of subscriber loops that are in substratum a and 
terminate on the ith MDF in stratum h, 
and 
Yniajk ANd Xhiajr, R = 1,2, ..., Qniaj, represent measurements associated 
with the Qnia; completed calls which originate from loop (hiaj). Loop 
(hiaJ) is identified as the jth loop terminating in substratum a of the ith 
MDF in stratum h. 
Some examples of applications of the ratio parameter F are given 
below. 
Application One: Fraction of Calls Where the Mean Transmitted 
Signal Power Exceeds Some Threshold 
Suppose Ypjiajz is defined as 1 if the kth completed call on loop (hiaj) 
is in the target population and the mean signal power exceeds some 
threshold T, and 0 otherwise. Second, suppose Xpjajz is defined as 1 if 
this call is in the target population, and 0 otherwise. R is then equal to 
the fraction of completed calls in the target population for which the 
transmitted mean signal power exceeds T. This form of the ratio pa- 
rameter is applicable to target populations such as completed calls (toll 
and/or local) originating from the Bell and/or CPE subclasses of 
loops. 
Application Two: The Mean Originating Signal Power Per Call 
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Suppose Xpjqjz is defined as in Application One, and Ypiajz, is defined 
as a measure of signal power of the kth completed call originating on loop 
(hiaj), then R is equal to the mean originating signal power per call. 


3.1.2 Survey sampling scheme 


The calls which were measured in the Loop Signal Power Survey were 
statistically selected in such a way as to permit precise estimates of the 
population parameters described in Section 3.1.1 and at the same time 
limit the costs of obtaining the measurements. The actual statistical 
sample selection scheme used was a classical three-stage sampling 
scheme with stratification and substratification. From each of the 12 
strata described in Section 3.1.1, three MDFs were selected with proba- 
bilities of selection proportional to estimates of the total number of 
business, residence, PBX, centrex, and coin semipublic lines terminating 
on each MDF. The locations of the 36 sampled MDFs are illustrated in 
Fig. 18. A stratified random sample of CPE and Bell loops, which ter- 
minated on the 36 MDFs, was selected, specially designed measurement 
equipment was connected to these sampled loops, and signal power 
measurements were made on a sample of calls originating over the loops. 
The selection of the CPE loops was made from a billing records inventory 
of subscriber telephone numbers that were being billed for a PCA with 
one of the Universal Service Order Codes (USOC) listed in Table VI. A 
random sample of Bell loops was obtained by generating a list of random 
four-digit numbers and prefixing a local three-digit NNX code for each 
NNX associated with the MDF. These lists were forwarded to the local 
repair service bureau for determination of the class of service of each 





Fig. 18—Locations of sampled MDFs. 
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telephone number and the location of the loop on the MDF. A stratified 
random sample of CPE and Bell loops, identified as members of the target 
population, was ordered according to the location on the vertical side 
of the MDF. Approximately 1 week prior to the scheduled arrival of the 
Bell Laboratories survey team, a verification was made by local operating 
company craft people to assure that each selected line was working, that 
the telephone number-cable-pair and horizontal frame assignments were 
correct, and that no bridged lines were present. From this verified list, 
a stratified sample of up to 30 CPE loops and at least 69 Bell loops (for 
a total of 99) were selected for connection to the survey equipment. The 
equipment included a device which, when activated, scanned the 99 loops 
for an originating off-hook signal. Following seizure of the loop and the 
establishment of a connection, the measurement process was started 
manually if a conversation ensued. Conversation was detected by util- 
izing an equipment operator’s monitor channel which provided unin- 
telligible speech during periods of conversation through the use of a low 
speech sampling rate. Because toll calls were relatively scarce, provision 
was made for the equipment operator to abort the measurement of local 
calls to obtain additional toll calls. The measurement period in a local 
office was 3 days. 

The survey equipment provided peg count data from which the 
number of originated completed calls was estimated for each loop. These 
data formed the basis for traffic weights used to estimate the target 
population parameters. 


3.1.3 Estimation formulas and confidence intervals 


This section is devoted to a discussion of the statistical estimation 
formulas that are used to estimate the ratio parameter R. These formulas 
are tailored to the survey sample design discussed in Section 3.1.2. The 
form of the estimation formulas require the following information rel- 
ative to the sampling plan: 

n,—the number of sampled MDFs in primary stratum h for h = 1, 2, 

...,L. (ny, = 3 forh =1,2,... 12). 

2pi—the probability of selection into the first stage sample of the :th 

sampled MDF in stratum h fori = 1, 2,...,n, andh =1,2,...,L. 

Mniqa—the number of measured subscriber loops that belong to the 

ath substratum of the ith sampled MDF in stratum h fori = 1,2,... 

Nh, a= d., Zz, coe Dhyj, and h = 1,2, eee oe 

Ghiaj —the number of calls associated with loop (hiaj) on which signal 

power measurements were made. 

L, Mnia, Dni and Qniaj are defined as in Section 3.1.1, and 

(Xhiajk» Yhiajr), R = 1, 2, ..., Uhiaj represents a sample of gniaj values 

of (Xhiajr, Yniajn), R = 1,2, ... Qniaj, where 

Xhiajk ANd Ypiaj, are defined as in the definition of R. 
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A three-stage estimator of the ratio R = Y/X where Y and X are de- 
fined as in Section 3.1.1 is 


r=y/x, 
where 
L_snth mhia Ghiaj 
y= Dd WhiajYhiajk; 
h=1i=1 j=1 k=1 
— 1 1 Mnia Qhiaj 
Whiaj = —__ 


Nh Zhi Mhia Chiaj 
and x is defined similarly to y with yniajp replaced by xXpiajr. 
The mean squared error of r is defined as 
VAR(r) = E(r — R)?, 


where E(-) denotes expected value. 
A consistent estimator of VAR(r) is 


2A 1 Mh TYni—TXni 1 Ph Yai — TXni |? 
pO) = Ss Se 
x*n=1NpA(Np-1) i=1 a Nhi=1 Zhi 


Dri Mhia Ghiai Mnig Qhiaj 
a = hak 
a=1 j=1 k=1 Mhia Qhiaj 
and xp; is defined similarly to yp; with yniajr replaced by xpiajr. 

An application of the Central Limit Theorem yields an approximate 
90-percent confidence interval for R as the interval 


(r — 1.645V v(r), r+ 1.645Vv(r)). 


Yhi = 


3.2 Data acquisition plan 


In this section, requirements pertaining to acquisition equipment 
capacity, compatibility, transparency, privacy, etc., are summarized, 
and a block diagram of the Loop Signal Power Survey acquisition 
equipment is discussed. 


3.2.1 Requirements 


As indicated in Section 3.1, the sample plan called for access to 99 
customer loops in each of 36 class 5 offices and measurements of near- 
and far-end signal power on live calls. Determination of call destination 
required the detection of call originations on loop start and ground start 
lines, and the detection of dial pulse and TOUCH-TONE® address infor- 
mation. Because of the loop-to-loop and call-to-call variability in im- 
pedance at the MDF interface, the measurement of real power was re- 
quired rather than bridged voltage. In the course of accessing and 
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measuring calls, no detectable impairment (loss or switching clicks) was 
to be added to the connection. Monitoring of intelligible speech was 
prohibited by privacy considerations. Speech signals are predominantly 
half-duplex in nature; however, both parties sometimes talked at the 
same time. Because the point of measurement was a two-wire point, it 
was necessary to devise a method to sort the speech signal data into two 
categories, near-end and far-end. 


3.2.2 Data acquisition equipment 


Figure 19 is a block diagram of the equipment used to acquire speech 
signal power data. The 99 customer loops were accessed at the protector 
socket of the MDF. Access cables connected the customers’ loops to the 
acquisition console protector panel. This panel provided series access 
to 99 loops, circuit protection, and an electrical interface with the in- 
strumentation switch. This interface contained current sensing resistors 
for the detection of metallic speech current and loop dc current. Modified 
service observing equipment was bridged across the tip-ring interface 
at this point to allow the detection of outgoing call seizures and the de- 
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Fig. 19—Loop signal power survey data acquisition console. 
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tection of dial pulse/TOUCH-TONE address digits. The instrumentation 
switch connected the four leads associated with the current sensing re- 
sistors of one of the 99 loops to the analog signal processing equipment 
for the detection, amplification, and filtering of the metallic speech 
voltage and current. 

The resulting voltage and current signals were simultaneously sampled 
at the rate of 200 samples per second using dual 12-bit A-D converters. 
The sampled data were stored in a buffer memory, combined with label 
information, and written in 16-kb blocks on a minirecorder magnetic 
tape unit. A paper-tape printer recorded off-hook event times for each 
of the 99 loops so that traffic weights referred to in Section 3.1 could be 
determined. In addition, the dialed area and office code were recorded 
on the tape. The digitally recorded speech signal data were subsequently 
analyzed in a manner described in the next section. 

The loss due to the current sensing resistors and bridged equipment 
was negligible. This, combined with click supression circuitry, made the 
measurement equipment transparent from the customer’s point of view. 
The low rate of sampling made the recorded speech signal unintelligible 
but allowed the recovery of pertinent signal power information. A low 
speech sampling rate was also used to make the equipment operator’s 
monitor channel unintelligible, yet permit the identification of call 
progress signals. The acquisition of simultaneous speech voltage and 
current samples permitted the discrimination of the near-end from the 
far-end talker in a manner discussed in the next section. 


3.3 Analysis of data 


This section explains how voltage and current samples were processed 
to obtain measures of speech signal power for each talker in the two-way 
conversations. 


3.3.1 Raw speech signal power data processing 


The raw data upon which speech signal equivalent peak level (EPL) 
and average power estimates are based consisted of metallic speech 
voltage and current samples. The metallic speech voltage and current 
on the loop were amplified and filtered to exclude signals higher than 
4 KHz and remove the effect of 60 Hz, its first two odd harmonics, and 
low frequency noise below 100 Hz. The resultant voltage and current 
analog signals were then simultaneously sampled at the rate of 200 
samples per second using two 12-bit linear A-D converters. The digital 
sampled data were then recorded on tape cartridges, which were later 
reformatted onto standard computer tape. 

The first step in computer processing of the digitally recorded signals 
consisted of removal of de bias produced in the analog signal processing 
filters and computation of the instantaneous power (watts) associated 
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with each voltage-current sample pair. The equipment was designed so 
that the power values were positive (voltage and current in phase) when 
the signal source was the near-end talker and and negative (voltage and 
current out of phase) when the signal source was the far-end talker. 


3.3.2 Discrimination of near-end and far-end talkers 


Conversational speech is predominantly half-duplex, but brief periods 
occur when both talkers are active at the same time. The stream of in- 
stantaneous power samples is therefore positive or negative for half- 
duplex talk-spurts. However, during double talking, the sign of the power 
samples may change rapidly and the magnitudes of the power samples 
become useless for estimation of near-end or far-end talker power. To 
properly sort the power sample stream into two distinct “bins” corre- 
sponding to the near-end and far-end talkers, empirical algorithms were 
developed in laboratory simulations, and one algorithm (SGN algorithm) 
was chosen for use during the speech signal processing phase of the 
survey. 

The SGN algorithm uses the sign and magnitude of the power in short 
subsequences of the stream of speech power samples to generate two 
sequences of speech power samples corresponding to near-end and far- 
end talkers. 

Let {p} be the sequence of instantaneous speech signal power values 
computed from the relationship: p = v-i, where {v} and {i} are sequences 
of instantaneous, simultaneous samples of speech signal metallic voltage 
and current, respectively. 

Let the sequence {p} be divided into consecutive subsequences of 
length /. Associated with the ith subsequence is the average power: 


— 1 il 
Bi=7 2 
k=il—l+1 
—lifp;>0 
Let SGN(p;) = Oif pj = 0 
+1 if pj < 0. 


The SGN algorithm depends on two conditions for every subsequence: 

Condition 1: SGN(p;) = SGN(p;-1) 

Condition 2: |p;| = a|p;-1]. 

If either condition is true, then SGN (p;) determines the sources of the 
speech signal for the ith subsequence. As stated earlier, the sign con- 
vention is such that a positive value indicates that the near-end talker 
is the source (far-end samples set to 0), and a negative value indicates 
that the far-end talker is the source (near-end samples set to 0). After 
the source is determined, the nonzero power samples are set positive and 
placed in the appropriate (near- or far-end) sequence. 


SPEECH SIGNAL POWER 2723 


If neither of the above conditions is true, then the direction is inde- 
terminant and all power samples in the :th subsequence are set to 0. 
Laboratory investigations established that the values | = 2 and a = 10 
give good performance with the sample rate used in the survey (200 
samples per second). The output from the SGN algorithm consists of two 
sequences of positive instantaneous signal power samples representing 
the near-end and far-end talkers. 


3.3.3 Measures of speech signal power 


Two measures of speech signal power are developed from each of the 
near-end and far-end sequences described above. The first measure is 
the average speech signal power defined over the observation interval 
(generally about a minute) as follows: 


1 7 
Near-end average power = 30+ 10 log— >~ p, near-end (dBm) 
Nk=1 


1 on 
Far-end average power = 30 + 10 log— >> p» far-end (dBm), 
Nk=1 


where py-end represents the elements in the sequence of instantaneous 
power samples for the direction of interest, and n is the total length of 
the power sample sequence. 

The second measure used to characterize speech signal power is an 
estimate of the peak power in the distribution of samples of talker signal 
power. The estimator is the empirical equivalent peak level (EPL), de- 
veloped by Brady. A complete discussion of the EPL and its properties 
is given by Brady in Ref. 2. The EPL is developed from the power sample 
sequence for the direction of interest as follows. 

Let the instantaneous power of the kth sample be defined as: 


Pk = Uplp watts. 
In logarithmic units, 
Pr = 10 log pz (dBw). 
Define a threshold ¢ and multiplier 6, so that: 
1 ifp,>@? 


é, = . 
‘ 0 otherwise 


The average power over threshold is defined: 


n 
D~ Pro 
k=1 


n 
~ oR 
pat 


Peo = 10 log 
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Now define D = py — ¢ dB. From D compute A using the following em- 
pirical rule: 


D < 6.75, then A = (D — 2.75)/0.4 
6.75 < D < 13.5, then A = D/0.675 
13.5 < D, then A = (D + 2.88)/0.819. 
From A compute EPL as: 
EPL=A+4¢. 


Some important properties of the EPL are that it is independent of 
the talker’s activity since it is not affected by the silent periods in the 
conversation, and its estimate varies little over a wide range of threshold 
values. Some laboratory investigations indicate that a threshold of 10 
to 20 dB below EPL gives good performance in the presence of noise; a 
threshold of 20 dB below EPL was selected as giving the best noise re- 
jection without discarding an excessive number of samples. The EPL 
computation was iterated until the threshold was 20 + 3 dB below the 
EPL value. 


IV. COMPARISON WITH PREVIOUS DATA 


In 1960, measurements of talker volume were made on live traffic using 
VU meters.! These measurements of talker volume are compared with 
the current survey results, which have been translated from EPL to VU 
using an empirical correction factor. These results are listed in Table 
VII together with the 1960 survey results. 

The 1960 survey results differ substantially from the current results 
in that the toll volumes were substantially higher in 1960 and the ranges 
of volumes within the various call destination categories were substan- 
tially greater. There have been some substantial changes in the telephone 
plant since 1960 that may help to explain these differences. The pro- 
portion of toll grade battery has decreased substantially, resulting in a 
decrease in toll call speech volume. Loss plan improvements, the phasing 
out of the 300-type telephone set, and the growth of direct trunking have 
all tended to increase the uniformity of service in the network and make 
it more transparent to customers. The apparent result is a network with 
remarkable uniformity of speech signal power. 


Table Vil—Comparison with 1960 speech volume survey 


1960 1975-1976 
Call Destination Average VU Std. Dev. Average VU Std. Dev. 
Intra-building —24.8 Ta =—22:2 4.6 
Inter-building —23.1 7.3 —22.5 4,7 
Toll —16.8 6.4 —21.6 4.5 
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An Adaptive PCM System Designed for Noisy 
Channels and Digital Implementations * 


By DEBASIS MITRA and B. GOTZ 
(Manuscript received November 29, 1977) 


We propose a new adaptive quantization scheme for digitally 
implementing PCM and DPCM structures. The arithmetics we develop 
for the digital processing are useful as well in the implementation of 
previously existing schemes for adaptive quantization. Two objectives 
are stressed here: (i) The system must be robust in the presence of noise 
in the transmission channel which causes the synchronization between 
quantizer adaptations in the transmitter and receiver to deteriorate. 
(wit) It must also minimize the complexity of the digital realization. In 
addition to the above objectives, we require, of course, good fidelity of 
the processed speech waveform. The problem of synchronization in 
digital implementations where the constraint of finite precision 
arithmetic exists has not been addressed previously. We begin by ex- 
amining an existing, idealized adaptation algorithm which contains 
a leakage parameter for the purpose of deriving robustness. We prove 
that, to provide the necessary synchronization capability without 
impairing the quality of speech reproduction, it is necessary to use a 
minimum, unexpectedly large, number of bits in the machine words 
and, additionally, to carefully specify the internal arithmetic, as is done 
here. 

The new scheme that we propose here uses an order of magnitude 
less memory in an ROM-based implementation. The key innovations 
responsible for the improvement are: (i) modification of the adaptation 
algorithm to one where leakage ts interleaved infrequently but at reg- 
ular intervals into the adaptation recursion; (ii) a specification of the 
internal machine arithmetic that guarantees synchronization in the 
presence of channel errors. A detailed theoretical analysis of the sta- 
tistical behavior of the proposed system for random inputs is given here. 
Results of a simulation of a realistic 16-level adaptive quantizer are 
reported. 


* A short version of this paper was presented at the International Conference on Com- 
munications, Toronto, June 1978. 
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l. INTRODUCTION 


We propose a new scheme for adaptive quantization which is partic- 
ularly well suited to the digital implementation of PCM and DPCM 
‘structures. In the course of this work, we have developed arithmetics for 
the digital processing that are useful as well in the implementation of 
previously existing schemes for robust quantization. 

The exacting requirements on adaptive quantization stemming from 
the broad dynamic range and rapid transient behavior of speech are well 
known. Two additional objectives are given equal importance here: (7) 
To make the system robust in the presence of channel errors. Thus, while 
channel errors may cause the quantizer adaptations in transmitter and 
receiver to be put out of synchronization,* a mechanism must exist which 
acts to rapidly restore the synchronization during periods of error-free 
transmission. (ii) To minimize the complexity of the digital realization; 
specifically, to minimize the length of the internal words in the digital 
processors and to facilitate the multiplexing of the hardware. 

Systems do exist in the literature for robust quantization in the 
presence of noisy channels; one such system is described below in some 
detail. However, the problem of synchronizing the quantizer adaptations 
in the transmitter and receiver in digital implementations, where the 
constraint of finite precision arithmetic exists, has not been addressed 
previously. We prove that, to provide the necessary synchronization 
capability without impairing the quality of speech reproduction, it is 
necessary to use an unexpectedly large number of bits in the internal 
words of the digital processors at both sites and, additionally, to carefully 
specify the internal arithmetic (which we do). If the digital processing 
is implemented using ROMs, as is being proposed, the long internal word 
length is reflected in large memory requirements and therefore costly 
implementations as well as exposure to new errors in the processing. 

The scheme that we propose here uses an order-of-magnitude less 
memory in an ROM-based implementation in both the transmitter and 
receiver. This is for comparable performance with respect to loading 
characteristic, signal-to-noise ratio, and the synchronization capability. 
Another advantage not reflected in the above estimate is the fact that 
the essential costly digital component, the ROM, as distinct from other 
less costly components such as adders, is used only for a small fraction 
of the total operating time. Thus, further economies may be effected 
through multiplexing the ROM. The key innovations are: (i) the modi- 
fication of the adaptation algorithm which allows the internal word 
length of the digital processors to be reduced significantly; and (ii) a 
specification of the internal arithmetic that guarantees synchronization 
in the presence of channel errors. As mentioned previously, the arith- 
metic is also applicable in digital implementations of previously existing 
adaptation algorithms. 


* In our us4ge, synchronization is synonymous with tracking. 
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A byproduct of the work reported here is that it establishes a link 
between two hitherto unconnected areas, namely, finite-arithmetic 
digital signal processing and waveform quantization in the presence of 
a noisy channel. The problem of synchronizing two geographically sep- 
arated digital processors gives rise to quite novel requirements on the 
processing, and we expect that the problem will be a subject of further 
investigation in the future. 

The paper is organized as follows. In Section 1.1 we describe an existing 
quantizer adaptation scheme and the associated synchronization 
problem. Section II is devoted to the basic description of the new scheme. 
Section 2.1 introduces the key idea underlying the scheme. Section 2.2 
considers the digital implementation of the system, and Section 2.3 
considers the synchronization behavior of the resulting system. Section 
III is devoted to the probabilistic analysis of the behavior of the proposed 
algorithm. The basic notions of the bias functions, central log step sizes, 
and load curves are introduced, and the qualitative results proved in their 
connection are stated. In Section IV, some computational results are 
presented in the context of a realistic 16-level quantizer that has been 
proposed and investigated previously in connection with an industrial 
application. We try to illuminate the topics considered in Sections II and 
III through examples involving this particular quantizer. Four appen- 
dices to the paper present the detailed technical derivations. 

On account of the length of the paper, we considered it desirable to 
include a final section, Section V, which summarizes and puts into per- 
spective the key results obtained in the preceding sections. 

We should mention that the digital implementation of adaptive DPCM 
systems is under investigation within Bell Laboratories in connection 
with TASI-D, subband voice coding, and new channel banks. The work 
reported here is a research study and not a description of a developed 
design. 


1.1 Background and description of the problem 


We begin by describing a system proposed in Ref. 1 which, unlike 
earlier systems upon which it is based,?-> possesses the capacity to re- 
cover from past channel errors during periods of error-free transmis- 
sion. 


1.1.1 An existing idealized scheme for robust quantization 
Let A(z) (see Fig. 1) denote the step size of a quantizer, with 2N levels, 
at the ith sampling instant; A(z) is adapted according to the rule 
A(Gi + 1) = A(t)8&M (2), i = 0,1,2,--- (1) 
where 6, 0 < 8 <1, is the leakage constant and M(i) is the multiplier at 
time i. M(t) is selected from a prespecified collection of multipliers 
{M,,Mo, --+,My} according to the rule: 
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EyA 





Fig. 1—The quantizer. A natural coding scheme is displayed. The step size is time- 
varying and the parameters (é,,) and (7,) are prespecified and fixed. 


If £,-1A(i) S |x(i)| < &-A@), then M(i) = M,, (2) 


_where x(i) is the input signal variable (speech or data) at time i and 
O = £0,£1,-++,én-1,EN = © are fixed, ordered parameters of the quan- 
tizer,* Fig. 1. The multipliers are also ordered, i.e., 


M, <= Mo S---< My. 

It is widely recognized®’ that (1) is not in a form convenient for im- 
plementation, even analog implementation. To utilize conventional 
multipliers, it is necessary to work with the log-transformed version of 
(1). 

Denote the log step size by d(i), where 


d(i) * loggA(i), (3) 
Q being a fixed number greater than 1, and the log multipliers by 
. m(i) = loggM(i), m, = loggM,, 1 sr <N. (4) 
Also let 
£, * loggé,, 1<r<N. (5) 
Thus, from (1) and (2), 
d(i+ 1) = Bd(t) + m(i), 1=0,1,--- (6a) 
where 


* When the parameters {é,} and {7,} are spaced equal distances apart, the quantizer is 
usually referred to as a uniform quantizer and it is natural to call A the “step size.” How- 
ever, for nonuniform quantizers, the term “step size” is less natural and other candidates 
are “scale” and “‘range.”” However, since there is no reason for confusion, we retain the 
familiar term “step size.” 
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m(i) = m, iff £,-1 + d(i) < logg|x(i)| < & + d(i). (6b) 


The only information that is coded and transmitted at time ! is that 
concerning the quantizer output which uniquely determines the selected 
log multiplier m(z). A natural coding scheme is exhibited in Fig. 1. The 
recursion in (6) is implemented at both the transmitter and receiver. We 
let m’(i) denote the log multiplier corresponding to the received code 
word at time 1, and we employ the natural notation d’(i) to denote the 
log step size in the receiver. The reconstruction, R(i), at the receiver of 
the input signal variable is done according to the rule: 

If m’(i) = m, then |R(i)] = 7-Q%©, (7) 
where 7,, 1 < r < N, are also prespecified, fixed parameters of the 
quantizer, as shown in Fig. 1. The sign of the reconstructed value is ob- 
tained from the sign bit, usually the first and shown as such in Fig. 1, in 
the received code word. 

The synchronization capability of the system, i.e., the capability 
possessed by the solutions of the recursions, {d(-)} and {d’(-)}, at the 
transmitter and receiver to approach each other during error-free 
transmission is entirely due to the presence of the leakage parameter £. 
For if d(0) and d’(0) are two, possibly different, initial values of the 
log-step sizes at the commencement of an epoch of error-free transmis- 
sion, then during the epoch | | 

|d(i) — d’(i)| = B'|d(0) — d’(0)], i=0. (8) 
The notion of introducing leakage as a mechanism for deriving robust- 
ness in the presence of a noisy channel is a well-known one in commu- 
nication practice; witness, the leaky delta-modulator.® 

As far as the synchronization of the transmitter and. receiver adap- 
tations is concerned, eq. (8) implies that decreasing 8 provides improved 
quality. However, there is an accompanying price. The data in Fig. 5 of 
Ref. 1 together with the theory developed here in Sections 3.2 and 3.3 
on the load curves (which describe the statistical behavior of the step 
size for random inputs) show that the statistical dynamic range of the 
step size is reduced rapidly with decreasing 8, with a concomitant de- 
terioration of the quality of the reconstruction.* Recent subjective tests!° 
have shown that it is very unlikely that 6 less than ®34 can provide ac- 
ceptable quality speech reproduction. 

Herein lies the gist of the problem: For good quality reproduction, the 
leakage parameter must necessarily be very close to 1, and this, on the 
other hand, makes it difficult to provide good quality synchronization. 
It is thus necessary to walk a narrow path between too small leakage and 
too large leakage. As we see next, the constraint of finite precision 


* Numerous related topics are treated analytically in Ref. 9. 
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arithmetic imposed by a digital implementation compounds the design 
problem. 


1.1.2 Digital implementations 


Equation (6) assumes continuous values of d(-) and infinite precision 
arithmetical operations, and hence it can only serve as an ideal in a digital 
implementation. An all-digital coder will have only a limited dictionary 
or total number (typically, =32, <128) of possible log step sizes. We will 
consider the log step sizes to be integers varying from 0 to 2% — 1; thus, 
typically, 5 < K <7. Itis necessary to introduce the notion of an internal 
machine word with K integer bits and, say, F fractional bits (the need 
for fractional bits will become apparent shortly); the log step size is ob- 
tained from the internal machine word at time 1, y(t), by means of an 
external arithmetic, such as truncation. Although later we will consider 
other possibilities, for the purpose of this discussion let us assume that 
the external word at time 1, which is the log-step size at that time, is 
simply the integer part of the internal word at time J, 1.e., 


d(i) = [y(7) ]truncates l = 0,1,2 Sn te (9) 
The machine implementation of the ideal recursion in (6) is 


where (Gy(t)) denotes some procedure, such as rounding, for taking By(v) 
into a (K + F)-bit word. It will turn out later that this operation is best 
viewed with greater generality as a mapping f of (K + F)-bit words, with 
F fractional bits into other such words. Thus we restate (10) as* 


yit=fly@j+m@, 1=0,1,2,---. (10’) 


It will be assumed that all the log multipliers {m,} have at most F frac- 
tional bits each, which ensures that if y(i) isa (K + F)-bit word then so 
is y(u + 1). 

Figure 2 shows an example of the most direct procedure for generating 
the discrete map f(y), namely, by rounding @y to the nearest machine 
word. In the example, considered F = 1 so that the spacing between 
machine words is 2~¥ = ¥. A feature common to such maps is that seg- 
ments of unit slope are juxtaposed between other segments of zero slope 
which we call “breaks.” 

If, as before, we distinguish the quantities associated with the receiver 
by the superscript ’, we see that the offset in the machine words behaves 


* In (10) and (10’) we have not made allowances for overflow. This however can be done. 
conventionally by employing saturation where: 
yt 1) = Oif (By(i)) + m(v)<0, 
= 2K — 2-F if (By(i)) + m(i) > 2K — Q-F, 
and in every other case (10) holds. Saturation acts to attenuate the offset in the machine 
words at the two sites. 


2732 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1978 


as follows during epochs of error-free transmission |[i.e., periods in which 
m(-) = m’(-)}: 

IyG+ 1) —y’GF DI = [fo O} — Ay’) (11) 
[compare with (8)]. 

The synchronization problem motivates us to impose the following 

two rather stringent requirements on the behavior of the offset. 

Synchronization requirements: 

(it) The offset is nonincreasing at all instants of error-free trans- 
mission. 

(iz) The integer parts of the machine words at the two sites, and hence 
the respective log step sizes, differ in at most a finite (preferably 
small) number of time instants during error-free transmission. 

We require the above to hold independent of the statistics of the input 
process. It is clear from (11) that these requirements imply restrictions 
on the discrete map f which are investigated below. 

Let us digress to better motivate the second of the above requirements. 

If the integer parts of the machine words at the two sites at any instant 
are not identical, then the respective log step sizes differ by at least unity 
and, hence, the ratio of the two step sizes is at least Q [see eq. (3)]; this 
factor may be unacceptably large since values of Q as high as 1.5 are being 


B=7/8 

FRACTIONAL BITS IN BINARY REPRESENTATION 
OFB,L=3[B=1-2-4 

FRACTIONAL BITS IN MACHINE WORD, F = 1 
RULE FOR GENERATING f {y): 

ROUND fy TO NEAREST MACHINE WORD 


f(y) 





LOCATION OF 
“BREAKS” — — = 


Fig. 2—An example of a naive machine arithmetic. 
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considered in practical designs.* To illustrate another facet of the second 
requirement, consider the case where, at a particular instant, the 
transmitter and receiver machine words are rather close, say, 1.9375 and 
2.0625 (F = 4). Yet the integer parts are 1 and 2, respectively. Thus the 
step sizes are Q and Q2, rather far apart. This example serves to illustrate 
that the mere proximity of the two machine words is not enough to 
guarantee that the log step sizes are identical. 

In the following discussion, we will need to know the value of L, an 
integer, which is such that 


1-2-L+1<B<1-2-4; 


if 8 = 7%, as in Fig. 2, then L = 3 and if 8 = ©, then L = 6. To simplify 
the following discussion, we shall assume that 


ane ae (12) 


i.e. Be {Yo, 34, 7/g, -- +}; with this form for 8, L is the minimum number of 
fractional bits required for the binary representation of 8. The as- 
sumption on the form of @ is unessential, and later in Section 2.2 we in- 
dicate that no difficulties are presented if 8 is not of the assumed 
form. 

We give two different but connected reasons which separately lead 
to the rather consequential conclusion that F = L if the resulting system 
is to have certain essential properties, including the synchronization 
capability. The first reason stems directly from the synchronization 
requirements. We show that the latter requires the map f to incorporate 
certain contraction properties which in turn can be possible only if the 
internal machine word has at least L fractional bits. The second related 
reason is that fewer than L fractional bits gives rise to rounding errors 
in each iteration of the recursion which makes it hard to predict the ef- 
fective value of the leakage parameter. Recall from Section 1.1.1 the 
stringent requirements on the leakage parameter. 

Below we amplify both the above arguments. This discussion will 
motivate a more exact treatment in Section 2.2, which will also Provide 
answers to the questions raised here. 

Consider (10’) in conjunction with the synchronization requirements 
(v) and (i). For the first of the synchronization requirements to be sat- 
isfied, it is apparent that it is necessary and sufficient that 


fy) —fO)| Sly —y’| (13) 


for all machine words y and y’. We refer to the above property of the map 
f as the weak contraction everywhere property. The map f shown in Fig.. 


* This is the case if K is 5 or 6. If K is larger, then it is possible to relax the second re- 
quirement by requiring that the offset in the integer parts of the machine words be reduced 
to asmall number (instead of requiring them to be identical). Thus it is possible to trade 
a higher K for a lower F while keeping K + F fixed. In any case, only minor modifications 
to the framework that is developed here will allow such cases to be handled. 
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2 possesses this property by virtue of the fact that the slope of the graph 
of f is everywhere either 0 (at the breaks) or 1. 

For the second of the synchronization requirements to be satisfied, 
we claim that it is necessary and sufficient that the map f have the fol- 
lowing property: 

If y and y’ are any machine words with different integer parts, then 


f(y) — fiy’)| < 6]y — y’| for some 6 < 1. (14) 


We call the above the strong contraction across integer boundaries 
property. Sufficiency is clear, since we have that during epochs where 
the machine words do not have identical integer parts and error-free 
transmission exists, 


ly@) — y’(@)| S 46']y(0) — y’(0)]. (15) 
Conversely, if (14) is not true, then it is easy to construct examples where 
the integer parts of the two machine words are different at an unbounded 
number of time instants. Referring to Fig. 2 we see that the graph of f 
does not possess the strong contraction property (14). To illustrate, 
suppose that initially the two machine words have different integer parts 
and that both words occur in the range [2.5,6]; we see from the figure that 
no mechanism exists to prevent the two words from indefinitely re- 
maining in this range and simultaneously having different integer 
parts. 

We will now argue that the above two contraction properties, together 
with any weak fidelity criterion relating f(y) to Gy, implies that F = L. 
Observe that the strong contraction property, (14), requires a “break” 
(see “breaks” in Fig. 2) in the graph of f(y) just prior to’every integral 
value of y. Reason: y = k — 2-¥ and y =k, k integral, have different in- 
teger parts. Further, if the local slope of the graph of f(y) is not zero, then 
by virtue of the weak contraction property it is either 1 or —1. Finally, 
if F fractional bits are used, then each unit interval of y is composed of 
2¥ intervals of equal length corresponding to that many distinct machine 
words. These three considerations show that the 
2F—] 

OF 





average slope of the graph of f(-) < = 12%. (16) 
But f(y) is supposed to approximate Gy, 6 = 1 — 2-£. Thus, just about 
any weak fidelity criterion will give that the smallest value of F', which 
allows the map f to have the properties required of it, is L. 

Our second reason is closely related to the aforementioned fidelity 
criterion. Implicit in a choice of a leakage parameter (6 with a large 
number of fractional bits, L, in its binary representation (e.g., 3 = ®¥64) 
is the requirement that the absolute rounding error in each iteration of 
(10’), |fly(z)} — By @)], be not larger (at least not by much) than an error 
in the least significant bit of 8, i.e. 2~4: 
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\f(y) — By| <2-£, for all machine words y. (17) 


Otherwise, there is no a priori need to specify 6 to that degree of preci- 
sion. (Our experience with the idealized system, discussed previously, 
shows that it is indeed necessary to specify 6 to a high degree of preci- 
sion.) A little thought will convince the reader that for such a bound, (17), 
on the rounding error to be valid it is necessary that the internal machine 
word have at least L fractional bits. 

In Section 2.2 we show that it is possible to obtain maps f with the 
weak and strong contraction properties that satisfy the fidelity criterion 
with the minimum possible number of fractional bits, i.e., F = L. We 
show that, in fact, the maps obtained are unique. The results will show 
that, for our maps, the offset in machine words during error-free trans- 
mission decreases exponentially fast to a value less than unity, after 
which there may be at most (24 — 1) occasions at which the integer parts 
differ. 

Let us now consider in broad terms what the preceding results imply 
in terms of the cost and complexity of the digital implementation of the 
scheme for adaptive quantization discussed in Section 1.1.1. Consider 
the fairly typical case where the total number of integral log step sizes 
is 64 and 6 = 834, i.e. K = 6 and L = 6. We now know that the total word 
length should be at least 12 bits. Consider the implications on the asso- 
ciated ROM size. The table stored in the ROM will have 212 addresses, each 
address containing 12 bits, giving a total memory size in the transmitter 
and receiver of about 50K bits each! Moreover, with each additional bit 
in the internal word, the memory requirement more than doubles.* 

In the next section, we propose a new adaptation algorithm and specify 
the required arithmetic. The new algorithm requires significantly fewer 
fractional bits in the machine words while possessing the necessary 
synchronization capability. 


ll THE PROPOSED SYSTEM 
2.1 Idealized description 
We propose the following interleaved-leakage algorithm (ILA) as the 
basis for the machine adaptation of the log step size. For fixed parame- 
ters I and y,] = 2 and 0 < yy <1 [see eq. (6)]: 
d(i + 1) = yd(t) + m(z) 
diit+2)=di+1)+m(+t+1) i=0,J,22,--+. (18) 
dit+lI)=ditI—1)+mi+I-—-1) 


* We have considered the possibility of exploiting the idea due to Croisier et al. (Ref. 11) 
and Peled and Liu (Ref. 12) wherein the ROM size may be reduced at the cost of increased 
processing time. The processing times available and the relative costs do not make this 
approach particularly promising at the present time. However, it is an approach worth 
keeping in mind. 
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Here y is the leakage constant, and leakage is introduced only once in 
every J iterations. Thus we refer to J as the interleaving interval. The 
m/(-) terms are the log multipliers, m/(-) € {m1, «++ ,my}, and the selection 
rule is as in (6b). However, in general, the optimum values of the mul- 
tipliers may be different from the ones in the scheme described in Section 
1.1.1 (we refer to the latter scheme as the uniform-leakage algorithm, 
or sometimes only as ULA). 

We observe that for two geographically separated implementations, 
{d(-)} and {d’(-)}, of the recursion in (18) subject to possibly different initial 
values, d(0) and d’(0), but identical {mm (-)} sequences, as is the case during 
error-free transmission, we have for the offset, 


|d(i) — d’(i)| = (y1/)‘|d(0) — d’(0)I, 1=0,7,2/,---. (19) 


Comparing (19) with the similar expression in (8) for the offset in ULA, 
we find that the capability for recovery from channel errors is comparable 
in the two schemes if 


yl = £. (20) 


The above is a key relation. Table I tabulates typical values of 8 and 
the corresponding choices of y and I which give comparable recovery 
capabilities. There are small, inconsequential errors in the table which 
has been obtained from the approximation y = [1 — (1— 6)]7/~1-—J(1 
— 8) for small values of (1 — 8). 

The important point about the table is that, for given 8, the fractional 
bits required for a binary representation of the equivalent value of y is 
reduced by an additional bit for every doubling of the interleaving in- 
terval, J, in ILA. This simple fact is at the heart of the system that is 
proposed. 


Table | — Leakage parameters (6,y) and interleaving intervals (I) 
for comparable synchronization capabilities in the uniform and 
interleaved leakage algorithms* 


7 (ILA) 
B (ULA) r=2 T=4 I=8 I=16 I= 32 
127/08 83/54 3/55 15g 1% 3M 
63/64 3I39 1Sh6 UP Vs 
3Is9 Ih 4g ae 


* We have stopped short of using y = 4 for two reasons. First, there may be no advantage 
in reducing y beyond 34, because two fractional bits may be required in any case on account 
of the specification of the log multipliers, m,. Second, the change in the step size may be 
too drastic, and this may be reflected in the subjective quality. However, it is a possibility 
worth keeping in mind. 
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A slight generalization of the proposed scheme would have the mul- 
tiplier set in the iteration where leakage y is inserted to be different from 
the common multiplier set in all other iterations. This generalization 
provides no gain when the midpoint of the input signal intensities (6 of 
Section IV) is scaled to be unity, which is the case considered in the 
simulations reported in Section IV. Goodman!® has suggested that, when 

-é +1, the log multipliers in the leaky iterations be m(-) + (1 — y) loggé, 
where {m(-)} are the log multipliers in the nonleaking iterations. 


2.2 The digital implementation 


We now consider the digital implementation of the idealized recursion 
(18). 

Here we let L, an integer, be such that 1 — 2-4+1< y <1—2-4. We 
make the simplifying, and inessential, assumption that y = 1 — 2-4;in 
this case, the binary representation of y requires L fractional bits. (Later 
we indicate through an example that it is easy to make the modifications 
which allow other values of y to be used.) Assume K integer and L 
fractional bits for the internal machine words. Thus, following the dis- 
cussion on the synchronization requirements in Section 1.1.2, we are 
assuming that the fractional bits in the machine words are the minimum 
necessary for the system objectives to be satisfied. Finally, assume that 
the log multipliers {m,} are specified to L fractional bits. 

The internal description of the machine is 


y+ 1) = fly} + m@) 
y@ + 2)=ye+ I t+me+ td) i= 0,L21,--+, (21) 


yitD=yitl—-i+mit+I-1) 


where y(-), the internal machine word, is a (K + L)-bit word with L 
fractional bits. In (21), f maps (K + L)-bit words with L fractional bits 
into other such words. The mapping f may be implemented most easily 
using ROMs; the characterization of the map f that we give below is a 
recipe for the programming of the ROMs.* 

The integral log step size d(-) is obtained from the internal word y(-) 
by arule determined by an external arithmetic. We consider two natural 
and simple external arithmetics, rounding and truncation. Thus, 


Rounding: d(*) = [y(-)]round (22a) 
Truncation: d(-) = [y(-)]truncate: (22b) 


We mean that if, for integral k,k —0.5<y <k + 0.5, then [y]-ouna = R; 
ifk <y <k +1 then [yliruncate = R. 


* Observe that the specifications of the maps given here and in Appendix A apply as well 
to the uniform leakage algorithm described in Section 1.1, provided 6 replaces y and the 
ep propnare value of the parameter L associated with the leakage parameter f in ULA is 
substituted. 


2738 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1978 


We consider first the truncating external arithmetic. Following the 
discussion in Section 1.1.2, we impose the following requirements on the 
map f. (It is understood that all arguments of the map have L fractional 
bits.) 


(i) Vo1,09, \f(o1) — f(a2)| S lor — 9): 


“weak contraction everywhere.” (28) 


(ii) oye[k,k + 1) 
oxelk + Lk +2) |f(os) ~ fos)! 


<6<l: 
k integral lo, — o9| 


“strong contraction across integer boundaries.” (24) 


(iit) Vo, f(a) — yo| < 2-4: 
“fidelity of discrete map to continuous map.” (25) 


Recall from Section 1.1.2 that the first two properties are equivalent to 
the synchronization requirements. We also know that these two condi- 
tions together with almost any weak fidelity criterion relating f(c) to yo 
implies that the number of fractional bits in the machine words is at least 
L. We find that we can construct maps f which satisfy in addition the 
fidelity criterion in (iii) without incurring the penalty of using more than 
L fractional bits. Also, as discussed previously, the fidelity criterion in 
(tit) is important in itself. 

In Appendix A we give the complete specification of a map for each 
value of L. In Fig. 3a, we show the graph of the map f for the example of 
y = 34, where L = 2. In Appendix A we also show that there is only one 
such map f for any given L which satisfies conditions (1) to (iii), (23) to 
(25). Further, for this unique map the value of the contraction parameter 
6 in (24) is 2y/(1 + y). 

When the external arithmetic is the rounding arithmetic (22a), the 





LOCATION OF 
“BREAKS” = — — — — — — 


Fig. 3—Machine arithmetics incorporating contraction properties and fidelity criterion 
for (a) truncating and (b) rounding external arithmetics. y = 4, and L = 2 (see Section 
2.2). 
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resulting map f is somewhat different. Appendix A gives the complete 
specifications of the maps for all values of L; these maps are also unique. 
Figure 3b shows the graph of one such map. 

Recall that earlier we made the simplifying assumption that yy = 1 — 
2-“. In general, L is defined to be such that 1 — 2-4+1< y < 1-274, 
Figure 4 illustrates a map f for the case of y = 5 (L = 2) and the trun- 
cating external arithmetic. It may be verified that all the requirements 
in (23) to (25) are satisfied. We may similarly generate maps satisfying 
the requirements for arbitrary rational values of y. 

Note that the maps obtained are rather special and quite distinct from 
the usual maps encountered in digital signal processing. 

Another point to note is that while we have specified arithmetics which 
use the minimum number of fractional bits, F = L, additional fractional 
bits, if they are available, may be put to use by incorporating more than 
one break in the graph of f(c) per unit interval of o. The net effect is to 
give superior synchronization capability. 

Finally, note that the implementation of (21) requires by way of 
hardware only the ROMs, for implementing the map f, and adders. 
However, the ROMs are used only once in every J iterations. This provides 
an ideal opportunity for multiplexing the ROMs between different 
channels and different frequency bands in subband coding!® applica- 
tions. 

2.3 Synchronization in the digital implementation 

We give some bounds on the offset between transmitter and receiver 
during periods of error-free transmission. 

By y and y’, two machine words, having different integer parts we 
mean in the following that [y]round ~ [ylround or Ly Jtruncate # LY’Jtruncate, 
depending on the external arithmetic chosen. Thus, depending upon 
whether the two machine words have identical or different integer parts, 
the corresponding log step sizes are identical or different, respec- 
tively. 


f(a) 


0 
0 1 2 3 4 


LOCATION OF 
“BREAKS” -——- -—— fp ee — 


Fig. 4—Machine arithmetic for y = 5 (L = 2) for two fractional bits in machine word 
and truncating external arithmetic. The contraction requirements and fidelity criterion 
are satisfied. 
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Suppose the machine implementations of the recursions in (18) in the 
transmitter and receiver during error-free transmission are: i = 
0,/,2I,--- 


yi + 1) = fly} + m(@) 
yet 2) syG+1) + met) 
yitl=yitl—-1)+mit+l-—1) 
yi + 1) = fly’) + mi) 
wit 2)=y’Gt1)+mit1) 


yYitH=y”Gtl-1)+mit+I—-1). (26) 
Observe that 
ly@+D-—yG+D|=---= ly@+1)-y’E+ I) 
= |fly@)} - fo’ @}I. 
Now from (23) and (24), 


[fly @)} — Aly’ @)} 
< |y(i) — y’@)| if y@) and y’(i) have identical integer parts, (27) 


< 5|y(@) — y’(i)| if yi) and y’(i) have different integer parts. (28) 


By repeated application of (28) we see that, if |y(0) — y’(0)| > 1, 
then 


lyG) — y’G)| <1 for all j > J log {y(0) — y’(0)|Aog(1/6). (29) 


Thus, once the offset is reduced to less than unity it subsequently re- 
mains thus. 

Now consider the case where |y(0) — y’(0)| < 1. Consider the time 
instants j which are integral multiples of J. There can be at most (24 — 
1) such time instants at which the integer parts differ. This is because 
a reduction of 2~# in the offset is guaranteed by (28) in every such time 
instant. However, at time instants which are not integral multiples of 
I, the convergence of the integer parts is not quite as strong and is a 
penalty (which we believe to be insignificant) of ILA. 


lil, ANALYSIS: PROBABILISTIC ASPECTS 


In this section, we investigate the probabilistic behavior of the log step 
sizes, {d(-)}, when the input signal variables, {x(-)}, are random and 
channel errors are absent. Clearly such an analysis is called for if we are 
to be able to guarantee certain qualitative features of performance that 
are basic and necessary in adaptive PCM systems.*° The key notions of 
the bias function, central log step sizes, and load curves are introduced 
and their qualitative behavior pinned down. 

For our purposes here, the defining equations for the log step sizes are 
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in (18); the selection rule for the multipliers are in (6b). The key as- 
sumption that is made throughout this section is that {x (-)} is a sequence 
of independent, identically distributed random variables with mean zero 
and standard deviation o. We sometimes refer to o as the signal intensity. 
In keeping with the characteristics of speech, we are interested in o in 
the range of. omax/Omin = 100, or even 400 (40 and 52 dB ranges, respec- 
tively). 


3.1 The bias function 
Define the bias function B(-|c) to be 
B(d|c) * Eld(@i+ I)|d(i) = d] —d, i=0,1,27,---. (80) 


A little thought will show that the right-hand side of (30) does not depend 
on i—a consequence of the iid assumption on the input signal variables. 
Different values of o will generally yield different bias functions, which 
explains the notation. In engineering parlance, B(d|c) measures, for 
initial log step size d, the mean drift of the log step size after one cycle 
of updating of the log step size. 

Weare able to show for a wide range of values of o that the bias func- 
tions consistently have a distinctive form, depicted in Fig. 5, of consid- 
erable significance. In particular, we show that B(d| c) is positive when 
d is sufficiently small, and negative when d is sufficiently large. Further, 
under:a rather mild restriction, we can prove the consequential result 
that B(d|c) is monotonic, decreasing with increasing d. The above results 
in their precise forms are proven in Appendix B. The restriction that is 
mentioned above is interesting in itself and, roughly, it calls for a pro- 
pensity for the expected log step sizes after one iteration to be ordered 
in the same way as the initial log step sizes. This turns out to require, 
roughly, that (my — m,) be not too large. 

The importance of the above results is on account of the following 
corollary which we state in qualitative terms: 

If (my — mj) is not too large, then there exists a unique root, or 

zero-crossing, of the bias function B(-|c). 

Without the monotonicity of the bias function, the possibility exists of 


B(d| a) 


c 
\ 
\ 
| 
| 
| 
| 


DIRECTION 
OF DRIFT 


> eee 


Fig. 5—Sketch of a bias function. 
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there being many roots with a consequent dilution of the importance that 
we attach to the root. 

Let c denote such a root for a fixed value of o, Fig. 5: 
Definition of c: 


B(c|c) = 0. (31) 


We refer to c as the central log step size (for signal intensity co). Fora 
different value of o and hence a different bias function, the root will 
generally be different, and to make this dependence quite clear we use 
the notation c(c). 

As the terminology implies, we expect the probability distribution of 
the log step size to have a concentration of mass around c(c) whenever 
the signal intensity is c. The reason for expecting this (see direction of 
drift indicated by arrows at bottom of Fig. 5) is that, whenever the log 
step size is not at c(c), the mean drift of the log step size is toward 
c(o). 

The above conclusion is amply borne out by computational results 
(see Section IV). We find, for instance, that the fit between c(c) and the 
mean log step size in steady state is extremely good for a rather broad 
range of values of o. 

In summary, the dual properties of the central log step size (namely, 
that it predicts so well the mean log step size and that it is so much more 
tractable and easily obtained) explain the emphasis that we place on the 
notion of the central log step size. 


3.1.1 Method for generating the bias function 


The following recursive formula which is developed in Appendix B 
is the most effective method we know for obtaining the bias function. 
First, it is necessary to define the following functionals: 


EQ? . 
b,(r) = 2 p(u)d(u), Leray, (32) 
r—-1Q7 ; 
where p(u) is the common pdf of the input signal variables {x (-)}. (It is 
slightly simpler to make as we do the inconsequential assumption that 
p(-) is symmetrical about 0.) Then B(d|<c) is obtained as the solution of 
the following functional recursion: 


Bo(d|c) = 0, Vd 


+ bbnG Malo ile petal 
B,(d]o) = V™ y (33) 
=(f=y)¢ + x b,(d){Br-1(yd + m,|o) + m,}, k =I. 


Finally, B(d|o) = B;(d|co). 
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The above formula is used in the following manner: Assume that the 
function B,-1(d|o) is known for all values of d. Use (33) to generate next 
the complete function B;(d|o). After I such iterations, the resulting 
function B;(d|c) is in fact B(d|c). 

The reader is referred to eq. (50), Appendix B, for the probabilistic 
interpretations of the ancillary functions B;(-|o). 

The above formula is used in the analysis presented in Appendix B 
to determine the previously mentioned qualitative properties of the bias 
function B(d|c). 

Figure 6 is a plot of the bias function B(d]|1) for a 16-level quantizer 
and normally distributed input signal variables. The interleaving in- 
terval, J, is 16. Observe in the figure that the graph is for d in the range 
[—200,800]. Values of d outside this range are not of much interest, since 
the maximum range of the log step sizes in this example is [Im,/(1 — y), 
Imy/(1 — y)] = [—163,828]. 

3.2 Load curves 

The load curves provide information regarding the manner in which 
the log step sizes depend on the input signal intensity, o. We use the term 
to describe a graph of logge vs. d, where d is the mean log step size in 
steady state for signal intensity o. Naturally, the range of o should cover 
the range of values expected in the specific application. 

From our previous discussion on bias functions and their roots, the 
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Fig. 6—The bias function for uniform 16-level quantizer and normally distributed input 
signal variables, o = 1. Interleaving internal, J = 16 and y = 0.777. The log multipliers are 
given in (39). 


2744 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1978 


central log step sizes, we expect a plot of logge vs. c(c) to be a rather good 
fit to the load curves. 

The utility of the load curve derives from the fact that it may be vi- 
sually compared with a plot of the ideal log step size with respect to o. 
This information may be obtained from solving a variational problem 
as is done by Max,!4 who has also tabulated the solutions for the case of 
normally distributed input signal variables. In any case, the solutions 
to the variational problem for the optimum log step size d(c) have the 
following form 


d(a) = loggo + D, (34) 


where D is a constant which depends on the fixed parameters of the 
quantizer and, importantly, on the common pdf of the input signal 
variables. 

Figure 7 is a plot of the load curve obtained for the 16-level quantiz- 
er. 


_LOAD CURVE: ~ 


d, STEADY STATE _ 
MEAN LOG STEP SIZE 


@ CENTRAL LOG STEP SIZE, c 
(SEC 3.1) 


© APPROXIMATE CENTRAL 
LOG STEP. SIZE, Capp 
(SEC 3.3) 





—40 —30 —20 —10 0 10 20 30 40 
LOGg 


Fig. 7—Load curve (d), central log step size (c), and approximate log step size (Capp) for 
uniform 16-level quantizer and Gaussian, zero-mean, input signal variables of variance 
o2. The log multipliers are given in (39) and Q = 1.1. Interleaving interval, J = 16 and 
leakage, y = 0.777. 
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3.3 The almost-linear dependence of the central log step sizes on signal 
intensity 


Even though a plot of logga vs c(c) may be expected to be a rather 
good approximation to, and certainly simpler to obtain than, the load 
curve (logge vs d), it is an unfortunate fact that it is not a very simple 
matter to obtain c(c). However, our graphs of c(a) have consistently 
displayed a most remarkable trait, namely, the almost-linearity of c(c) 
with respect to o. Intrigued by this feature, we found in an earlier study? 
that it could be explained if the following rather unusual approximation 
is effective: 


y 
f, p(u)du a logy + a9, (35) 


where q, and ag are constants and p(-) is the common pdf of the input 
signal variables scaled to have unit variance. 

Certainly, the above cannot be a good approximation when either y 
is very small or y is very large. But, as we see in Appendix C, we need the 
above to be a good approximation only for a limited range of y; specifi- 
cally, the range of y is required to include the range encountered by 
£,Q2 at one end, and £y_;Q2) at the other end, where d(-) is the typical 
log step size. It turns out that in the important cases where p(-) is either 
Gaussian or Laplacian, the range of validity of (35) is adequate, at least 
for the analysis of quantizers with up to 16 levels (N = 8). Further details 
may be found in Ref. 9. For both these distributions, we have found (35) 
to be an effective approximation in the range 1/3 < y < 2. For the former 
distribution, we have found good fits to be obtained if 


ay = 0.44 and ag = 0.34. 


(Below, we find it more convenient to express the rhs of (35) as a, logg 
yr a2.) 

With (35) as the sole approximation, in Appendix C we go through the 
involved and tedious process of approximating the bias function and 
thence deriving its root. The final result, however, is the following re- 
markably informative formula (Capp(c) is the approximate central log 
step size for signal intensity o): 


Capp(s) = S logge + D, (36) 
where 
1 
Ss = —— (37) 
; (1 — y){1 — 2a,(my — m,)}/7} 
: 1 — {1 — 2a;(my — m,)} 
and Oy N 1 


N-1 2 
my — 2 X (m,41 — M,)(a1é, + ae) 
r= 


BSS 38 
2a ;(my — Mm) i 
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Let us remark on certain features of the formula. Observe that, on 
account of a; being small, 1 — 2a;(my — m;) > 0 almost certainly; for 
example, a; = 0.018 when Q [see eq. (3)] is 1.1 and the input signal 
variables are Gaussian. Consequently, we observe, from the formula in 
(37) for the slope S, that S < 1. Now the ideal slope is 1 [see (34)|. Thus 
eq. (37) expresses the undesirable but expected fact, alluded to earlier 
in Section 1.1, that decreasing the leakage parameter vy has the effect of 
driving the load curve away from the ideal, as sketched in Fig. 8. 

As a digression, note that when y = 1, the slope S is unity. This is, of 
course, known to be the case.*> We may also compare the expression for 
S with a similar expression for ULA derived in Ref. 9—the two expres- 
sions are practically identical when y = @/ [eq. (20)] and @ is close to 
unity. This important fact, also confirmed in simulations in the example 
of Section IV, shows that in terms of the loading we expect the behavior 
in ILA and ULA to be roughly equivalent. 

One of the uses that formulas (36) to (38) can be put to is in the opti- 
mum choice of the multipliers. The approach we take is that y and (my 
— m}) are determined a priori on the basis of requirements arising from 
the quality of synchronization and transient response, respectively. This 
then fixes the value of S, eq. (87). However, there is still considerable 
freedom in the choice of the quantities (m,;; — m,),1<r<N-—1,and 
thereby in the choice of the value of D, eq. (88). This degree of freedom 
may be exploited to determine the point of intersection of the graph of 
Capp(o) and the ideal graph, which are shown in Fig. 8. A sensible choice 
for the point of intersection is at the signal intensity, o, that is most likely 
to be encountered. Usually,! this is at the midpoint of the range of signal 
intensities expected to be encountered in the application. 


IV. COMPUTED RESULTS 


Throughout this section, the input signal variables {x(-)} are inde- 
pendent, Gaussian, random variables with mean zero and standard de- 
viation o. The signal intensity o is varied about a central value of 1.0. 

The quantizer is a 16-level, uniform quantizer, i.e., N = 8, =r, 1 
r<N-1l,andn,=r-—%,1<r< WN. Throughout, the log base for the 
step sizes and multipliers, Q, is 1.1. 


IDEAL LOG STEP SIZE, (34) 







Cony (7), (36) 


LOGgd 


ear 8—The behavior of the central log step size compared to the ideal. See eqs. (34) and 
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For the uniform-leakage algorithm, ULA, we used as the leakage con- 
stant 8 = ®,. The multipliers for ULA are approximately those used by 
Rosenthal et al.!5 after correction, in the manner suggested in Ref. 1, for 
the following specifications: In the notation of Ref. 1, ¢ = midpoint of 
signal intensities = 1.0, the ideal loading factor = ideal step size/signal 
intensity = 0.257. This procedure gave the following values for the log- 
multipliers for ULA, 


m(1) = m(2) = m(8) = m(4) = —2.25; m(5) = m(6) = 2.50; 
m(7) = 7.25; m(8) = 11.50. (389) 


The multipliers used for the interleaved algorithm, ILA, were also 
selected to be those given above. We are aware of the advantages of fine 
tuning the multipliers and Q to take advantage of the special features 
of ILA, but decided on balance to keep the multipliers and Q unchanged. 
We found that, as it stands, the transient behavior for ILA is slightly 
superior to that of ULA; reducing Q in ILA equalizes the transient be- 
havior in the two schemes and yields s/n ratios slightly better than those 
reported here for ILA. 


4.1 Computed load curve, central log step sizes, and their approximation 


We illustrate the above notions for the interleaved leakage algorithm 
for the case of the interleaving interval, I = 16. We set y = B! = 0.777. 
Figure 7 plots three quantities with respect to logga: (i) d, the steady- 
state, mean log step size. This was obtained from 10,000 iterations; (ii) 
c(o), the central log step size defined in (31); (vii) Capp(o), the approxi- 
mate central log step size as given by (36) to (38). 

For the given specifications, 


Capp(o) = 0.99 logao — 13.20. 


To clarify Fig. 7, we have also tabulated in Table II the values of the 
above variables at seven values of o. 


Table Il — Computed load curve, central log step sizes, and their 
approximation (/ = 16 and y = 0.777) 


a, Signal intensity 0.05 0.10 0.3162 10 3.162 10.0 20.0 


d, steady state mean log step -—42.40 —35.53 —24.14 —12.46 —0.84 10.81 17.88 
size 

c(a), central log step size —44.35 —37.07 —25.00 —-12.93 —-0.85 11.24 18.51 

Capp(c), approximate central -—44.63 —37.36 —25.28 —13.20 —1.12 10.96 18.23 
jos step size 
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4.2 S/N ratios and load curve for ULA and ILA 


Table III compares signal-to-noise ratios for the two schemes for a 
variety of interleaving intervals. The signal energy is simply the energy 
of the variables {x (-)}. The noise is exactly the difference between the 
input signal variable and its reconstruction at the receiver, assuming 
error-free transmission. Thus, the reported s/n ratios reflect the effect 
of the step-size adaptation algorithms but do not measure synchroni- 
zation capabilities of the systems—the latter is measured separately in 
Section 4.3. 

Note the almost identical s/n ratio performance for the two algorithms, 
ULA and ILA. 

Tables IV and V compare the mean and standard deviations of the 
log step sizes. Again, note the uniformity of the results for the ULA and 
ILA; the loading characteristics of the two approaches are almost iden- 
tical. 


Table Ill — Signal-to-noise ratios (dB) 
ILA; ] = 2 ILA; / =4 LA;|J=8 ILA;J =16 
ULA y=R= y=6t= y= p8= y= Bib = 
o B = eq 0.969 0.939 7, 881 0.777 
0.10 14.89 14,92 14,90 14.70 14.16 
0.3162 14.55 14.57 14.56 14.48 14.17 
1.0 14.19 14.16 14.18 14.14 14.13 
3.162 13.80 13.77 13.63 13.84 13.76 
10.0 13.37 13.30 13.36 13.31 13.24 
Table IV — Steady-state mean log step sizes 
ILA; J = 2 ILA; ] = 4 ILA; I = 8 ILA; J = 16 
ULA Y= a= 1h = Vas (Ye BS 
G B = 836, 0.969 0.939 0.881 0.777 
0.10 —35.75 —35.78 —35.72 —35.66 —35.53 
0.3162 —24.12 —24.11 —24.10 —24.13 —24.14 
1.0 —12.47 —12.54 —12.47 ~12.54 —12.46 
3.162 —0.88 —0.90 —0.87 —0.82 —0.84 
10.0 10.74 10.81 10.84 10.78 10.81 


Table V— Standard deviation of log step size in steady state 


Bea ILA; ] = 4 Ce 8 me 16 

ULA = Be = y= B4 = = ps8 = = a = 
o B = 83,4 oO 969 0.939 "0. 881 "04 77 
0.10 4.48 4.50 4.57 4.75 5.26 
0.3162 4.56 4.63 4.64 4.74 4.97 
1.0 4,70 4.69 4.74 4.74 4.85 
3.162 4.80 4.80 4.81 4.81 4.80 
10.0 4.87 4.90 4.88 4.89 4.97 
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4.3 The steady-state mean offset in the transmitter and receiver log step 
sizes 


Here we present some computational resu!ts connected with the 
steady state, joint distribution of the transmitter and receiver log step 
sizes assuming, as we have done throughout Section IV, that the input 
signal variables are independent, normally distributed. 

The channel is assumed to be memoryless; further, the event that a 
transmitted “1” is received as a “0” and the event that a transmitted “0” 
is received as a “1” have the common probability p. Thus, p is the bit 
error probability. In the numerical results presented below, the following 
typical value for the bit error probability is assumed: p = 1074. 

Two geographically separated implementations of the interleaved 
leakage algorithm, (18), are assumed to be occurring: i = 0,/,2I, --- 
d(i+ 1) = yd(t) + m(i) 
d(ii+2)=dii+1)+m(it1) 


di+1)=d(iitI—-1)+mG+I-1) 
d’(i+1) = yd’(i) + m’(i) 
i+ VadGty+mG+y ie 
dith=aditIl—-1l)+m(Gi+I—-1)° 
The information regarding the log multipliers m(-) are assumed to be 
coded in the manner shown in Fig. 1 and transmitted through the 
channel described above. The log multipliers m’(-) are the log multipliers 
corresponding to the received code word. 
By the “steady state mean offset in the transmitter and receiver log 
step sizes” we mean the quantity @ where 


é = lim E{d(z) — d’(i)} (41) 
In Appendix D we show that é@ is given by the following expres- 
sion:9 


I N 
si (m, — ms)TsrPr, (42) 
di Y rs=1 





e= 


Table VI — Steady state mean offset in transmitter and receiver 
log step sizes. Bit error probability in channel, p = 1074 


ILA; I 2 2 ILA; I z 4 ILA; I = 8 ILA; J = 16 


ULA Bie ais bial es 7S S VSB 

g = 6364 0.969 0.939 0.881 0.777 
0.10 —0.025 —0.025 —0.025 —0.026 —0.026 
0.3162 —0.022 —0.022 —0.022 —0.023 —0.024 
1.0 —0.020 —0.020 —0.020 —0.021 —0.022 
3.162 —0.018 —0.018 —0.018 —0.018 —0.020 
10.0 —0.015 —0.015 ~0.015 —0.016 —0.017 
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where T = {T,,} is the channel transition matrix given below and p, is 
the steady state probability that the rth code word is transmitted (00- - -0 
is the first code word, 11-- -1 is the last, Nth, code word; the sign bit is 
ignored). 

The channel transition matrix T7' is defined thus: 


Tr = Pr [sth code word recd. |rth code word trans. |. 


In the special case where the codes are as shown in Fig. 1, the elements 
of the matrix are obtained in a simple manner from the Hamming dis- 
tance between the code words. Thus, if d(s,r) is the Hamming distance 
between the sth and rth code words, then 


Tee pena pyrene Lssra nN, (43) 


In the example under consideration where N = 8, T;; = (1 — p)3, Ti2 = 
p(1 — p)’, ete. 

The formula given in (42) for @, the mean offset in log step sizes, is 
extremely useful. To see this, recall that @ is defined in (41) in terms of 
the joint behavior of the transmitter and receiver in steady state, yet (42) 
provides the means for calculating @ provided only that the transmitter 
log step size distribution is known, since the quantities {p,} are statistics 
of the latter distribution. Thus, the considerably harder task of evalu- 
ating the joint distribution of the log step sizes at the two different sites 
is circumvented. 

Table VI enumerates the computed steady-state mean offset in 
transmitter and receiver log step sizes for various signal intensities and 
designs; note the almost identical performance. 


V. SUMMARY 


We consider it important that digitally implemented adaptive 
quantization systems possess two properties which, regardless of the 
statistics of the input signal, ensure that synchronization in the step-size 
adaptations at the transmitter and receiver is restored during periods 
of error-free transmission: The offset in step sizes is monotonic and 
nonincreasing and the step sizes differ in at most a finite number of 
sampling time instants. A detailed examination of the uniform-leakage 
algorithm (ULA) shows that a necessary and sufficient condition for the 
synchronization requirements to be satisfied is that the internal machine 
arithmetic, given by the nonlinear map f, possesses certain contraction 
properties. It is further shown that these contraction properties may exist 
only if the number of fractional bits (F) in the internal machine word 
is at least L where the leakage parameter @ is such that 1 — 2-2t+1< 8 
<1- 2-4“. Thus, if 8 = 1 — 2-4 then L is the number of fractional bits 
required for the binary representation of 8. We proceed to show that it 
is actually possible to obtain internal machine arithmetics which satisfy 
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all the requirements with the minimum possible number of fractional 
bits, i.e., F = L. The arithmetics that we obtain are moreover unique. 
With these arithmetics the offset in machine words during error-free 
transmission decreases exponentially fast to a value less than unity, after 
which there may be at most (24 — 1) occasions in which the step sizes 
differ. , 

We give a complete specification of the unique maps f. Thus, in the 
case where truncation is used to obtain the log step size from the internal 
machine word, the formula that generates f is: 

If o =k + j2-“, where k andj are integral and 0 < j < 24 — 1, then 


f(o) = k( — 2-4) + 2-4. 


Figure 3a is the graph of the map f for the example of L = 2. 

Even the minimum length of the machine words translate into large 
memory requirements in ROM-based implementations. Thus, in the 
fairly typical case where the total number of step sizes is 64 and the 
leakage parameter 8 = ®34, we find that the minimum word length is 12 
bits, which translates into a ROM size of about 50K bits. 

We propose a new adaptation algorithm which is considerably more 
efficient in terms of the memory used in the implementation. In this 
algorithm, ILA, leakage is interleaved infrequently but at regular inter- 
vals into the recursion for the step-size adaptation. Thus, this scheme 
has as parameters y, the leakage parameter, and J, the interleaving in- 
terval. We find that, for comparable synchronization capabilities in ULA 
and ILA, the parameters are related thus: 


yl = B. 


Thus for 8 close to unity, y ~ 1 — J(1 — 6). Table I shows that for given 
6 the fractional bits required for the binary representation of the 
equivalent value of y is reduced by an additional bit for every doubling 
of the interleaving interval. 

To illustrate, consider the example given above where 8.= ®%4; the 
new scheme provides the option of interleaving leakage once in 8 itera- 
tions (J = 8) with a leakage parameter y ~ “, which has three fractional 
bits. Thus, for the same total number of step sizes, the total word length 
required is 9 bits, which translates into an ROM size of about 5K bits and 
an order-of-magnitude reduction in memory size. Furthermore, the es- 
sential costly element of the system, the ROM, is used only once in 8 it- 
erations, thus allowing for the additional multiplexing of the ROM. 

The internal machine arithmetic that is proposed for ILA is identical 
to that specified for ULA, except that the machine word in the former 
system is of shorter length. 

A detailed theoretical analysis of the statistical behavior of the step 
sizes for independent random inputs is undertaken. Perhaps the most 
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insightful result obtained is a simple formula giving the approximate 
dependence on the input signal intensity, o, of the central log step size, 
c(o), which is the particular log step size about which the distribution 
of log step sizes is concentrated. The formula depends on only two pa- 
rameters, a; and ao, of the input signal distribution; in the case of 
Gaussian input distributions, a; ~ 0.44 log Q and a, ~ 0.34. This simple 
formula is given in (36) to (38). 

The idealized adaptation algorithms were simulated for a represen- 
tative 16-level quantizer and independent, Gaussian inputs. In the 
simulations, the multipliers in ILA were selected to be identical to those 
used in ULA, although in general we expect the optimal multipliers to 
be different for the two schemes. The results of the simulations show 
that the performances of the systems are almost identical. 


APPENDIX A 
Specification of the Machine Arithmetics 


We describe first the maps f corresponding to the truncating external 
arithmetic in (22b) which satisfy conditions (1) to (iii) given in (23) to 
(25), Section 2.2. In the example shown in Fig. 3a, observe that the 
breaks, i.e., zero slope segments between pairs of points, occur just prior 
to the integral values of o. This is also the rule by which f is obtained for 
general values of L. 

The following formula generates f for general values of L: 


Ifo =k+ j2-4,k andj integral and0 <j < 24-1, (44) 


then f(o) = k(1 — 2-4) + j2-4. 
Condition (1), (23), is trivially verified. For condition (ii), (24), note 
that for all integral k 
f(R+ 1-2-4) -—f(k +1) =0. (45) 


Thus a strong contraction across integer boundaries exists and, in fact, 
for o, and o2 with different integer parts 


[for) = flo)| . _2y 


lor—on] +7 oe 
so that we may take 
6=2y/(1+y) <1. (47) 
For the final condition (iii), we find that 
0<f(o) — yo < 2-4(1 — 2-4), (48) 


where the two inequalities become equalities at c= k anda =k — 274, 
respectively, whenever & is integral. 
We can also show rather easily that the map f given by (44) is unique, 
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i.e., there does not exist any other map satisfying the requirements (i) 
to (iit). Uniqueness follows from the following two reasons: (a) Condition 
(it) requires that there be a break in the graph of f between o = k — 2-4 
and o =k, k integral, i.e., f(k — 2-4) = f(k). Reason: o = k — 2-/ and 
o = k have different integer parts. (b) In order to satisfy at once both the 
fidelity condition (tit) and the weak contraction (1) there can be at most 
one break in the typical integer interval [k, k + 1]. 

We now describe the slightly different map f which is obtained for the 
rounding external arithmetic, (22a). For the requirements on f, the only 
difference is in condition (ii) which now reads as follows: 


(ti’) oye(k -- Yo,k + Yo] = (a1) — f(oe)| 
ook + Yok + %] |o1 — o9| 


The graph of f shown in Fig. 3b is obviously similar to the one displayed 
in Fig. 3a, the main difference being the locations of the breaks which 
are here positioned immediately following the midpoint of the integer 
intervals. 

We rapidly summarize the key features of f. The formula for gener- 
ating f for general L is: 
Ifo=k—%+ 2-4, k andj integral, 1 <j < 24, 


then f(o) =k(1— 2-4) — + j2-#. (44’) 


<6<1. (24’) 


The weak contraction condition (i) is trivially satisfied as well as the 
strong contraction condition (ii’), (24’), with the same value of 6 that was 
previously obtained: 


6=2y/(1+y) <1. (47’) 
Finally, 
[f(c) + yo|:= 2-474, (48’) 


and hence condition (iz) is also satisfied. It is noteworthy that in keeping 
with the familiar properties of rounding and truncating, the above error 
bound is generally smaller than the corresponding bound in (48) for the 
truncating external arithmetic. . 

The arguments used previously for establishing uniqueness apply as 
well for the above construction. 


APPENDIX B 
On the Bias Function 

We give here the derivations of the results on the bias function that 
are stated in Section 3.1, accompanied by more detailed insights and 
interpretations. It is convenient to drop the adjunct o in B(-|c), the bias 
function, with the understanding that here o is arbitrary, but fixed. 
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B.1 Generating the bias function 


We derive (33), which is a functional recursion yielding the bias 
function, 


B(d) = E[d(i)|d(O) = d] — d. (49) 
Define the ancillary functions 
B,(d) + E[d()|dU —k) =d]-—d,0sk <I, (50) 
so that 
B(d) = B;(d). 
Observe that 


E[d()|d(U —k) = d] = © sPrld(J) = s|dU —k) =] 
= ¥ Prld(I —k + 1) =t|d(—k) =d] 
t 


x Eld(I)|dU —k+1)=t], (51) 


where the Markov property has been used to obtain (51). Now ¢ can take 
only N possible values. In fact, from (18), we see that if k < I, then te{d 
+ m,|r = 1,-++,N}, and if k = I then te{yd + m,|r = 1, --++,N}. Further, 
the respective probabilities are easily given in terms of the functionals 
b-(y), 1 <r <N, defined in (382), of the common pdf of the input signal 
variables. Thus, 


b,(d) = Prlé--19¢ S |x()| < &Q7] 
Prid(i-k+1)=d+m,|d(I —k) =d], 


jeer Slot (52) 
Prd —k+1)=yd+m,|dU —k) = d], 
k=]. 
Substituting in (51), we arrive at the relations 
E|d(I)|dU — k) = d] 
N 
>~ 6,(a)E[dd)|dd —k+1)=d+m,], 1<k<I-1 
_ Jr=l 
Vn 
>~ b,(d)E[d)|dI —~k+1)=yd+m,], R=. (53) 
r=1 


Substituting in the expressions in (50) for the functions B; (-), we obtain 
the recursive formula given in the main text: 


Bo(d) = 0, 
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S bdibe mand Aer aa (54) 

B,(d)= 4"— ‘3 
-—(1-y)d+ ¥ b,(d){B,-1(yd + m,) +m,}, k=I, (55) 

r=1 


and B(d) = B;(d). 


B.2 The range of the bias function 
Note that, as d — —~, the values of all the probabilities b,(d), ---, 
bn—1(d) approach 0, while by(d) — 1. Similarly, as d — ~, the values 
of all the probabilities bo(d),---,by(d) approach 0, while b;(d) — 1. 
Thus, from (54) we have that 
As d > —~, B,(d) > mn, and asd —> ~, B,(d) > m,. (56) 
Iterating, we obtain that 
As d > —, B;-;(d) > (I — 1)my, 
asd > o, By_1(d) ae (I = 1)m,. (57) 
Finally, for the bias function we obtain from the above and (55) that 
d—-> —o, B(d)  -—(1— y)d + Im, > 0 
d — o, ~ —(1—y)d +Imy <0. (58) 


The above is the basis for the claim that at least one zero-crossing of the 
bias function is guaranteed from observing the values of the function 
at the two limits. 


B.3 The monotonicity of the bias function 


We establish here sufficient conditions which imply the rather im- 
portant monotonicity property of the bias function. Equations (54) and 
(55) provide the working definition of the bias function. Observe from 
(54) that forl <k <J-1, 


Bi (d) = b’(d)|By_a(d + m,) + m,} + 5 b(d)By-a(d + m,) 
r= r=1 


N-1 
=— > F,(d){By_-1(d + m,+1) — Bp-i(d + m,) + mp41 — m,} + ”. 
r=1 


(59) 
We have found it convenient to introduce 
F.(d)* > b,(d), 15rN. (60) 
s=1 


The reason for this is that F(d) is positive since 


Fed)=2 f° oe sidu (61) 


At this point, it is worth noting from (59) that B,_, < 0 is not enough 


2756 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1978 


to establish that B), < 0; it is necessary in addition that B),_, be not ex- 
cessively negative. This motivates the bounding of the derivative of B,—1 
from both below and above. We therefore introduce the quantities 


a, SminB,(y); max B,(y) S Ba, (62) 
y y 
where it is understood that we are only interested in y having values in 
the finite dynamic range of the log step size. Further, let 


N=-1_, 
d(d) = X F.(d)(m-+1 — m,) (63) 

and 
0 < bmin S 5(d) S Snax: (64) 


From (59) we obtain 
B,(d) S —6(d)(ap—1 + 1) + Be-1 
< —dmin(Qk~1 -- 1) = i Br-1; assuming Qp-1 BS —1. 


Thus, we may take 


Be = —Smin(ag-1 + 1) + Be-1, (65) 
provided az; < —1. In identical fashion, we also obtain 
a = —§max(Be—1 + 1) + ap-1, (66) 


again assuming a,z_; S —1. 

Summarizing, we have at this stage a coupled pair of recursions for 
the upper and lower bounds on the derivatives of the functions B;, 1 < 
k <I-—1, provided a,-; = —1,1<k <J —1. Finally, we also have from 
(55) that 


B’(d) = By(d) S (1 — y) — 6min(a@r-1 + 1) + YBy-1. (67) 


We may now solve the linear recursions in (65) and (66) for (az, Bz) 
with the initial conditions ap = Bp = 0. The following solution is obtained: 
1<k <1, 


— 
~ 
3 
i) 
a 





ay = 5a + OPO) ee LO) (Lo) ad. (68) 


2 0 





Br =F +544 (1 - 54-5. -{(1+6)% — (1 —6)*}-—1. (69) 


Omax 


We have denoted by 6 the geometric mean of dmax and dyin, ie., 
5 me (Ompadmay) (70) 


The reader will recall that the recursions (65) and (66) were contingent 
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upon a,-1 = —1. We find, upon examining the “solutions,” that we can 
ensure its validity over the range 1 < k < I — 1 provided aj-; 2 —1, 
1.e., 

(1+ 6)!-1+ (1 — 6)/-1 


dmax 26° = ——., 
. (1 + 6)/-1 — (1 — 6)! 


(71) 


The above is a key relation. The first observation on it is that the 
relation implies not only that a;_; = —1 but also that 6;_-, < 0, which 
is of primary interest. This may be verified either directly from the ex- 
pression in (69) or, more conveniently, from the recursion in (65) for 8; 
and the fact that 8) = 0. But, as an examination for the bound on B’(d) 
in (67) shows, these two conclusions, namely, az; 2 —1 and 6;-; < 0, 
are sufficient to guarantee that B’(d) < 0. We have thus arrived at the 
main result of this section: 


If dmax Satisfies the inequality (71), then B’(d) <0. (72) 


Some insight into the nature of the inequality (71) may be gained by 
considering the case of 5 < 1. In this case, the rhs of (71) reduces to 1/(I 
— 1). Further, we observe from (68) and (69) that a, ~ —k6bmax and B; 
=~ —k6b min. Thus, summarizing, we have that 


Ifd<«i1thena, ~ —kimax, Ber¥—Rimns ISkRSI-1 
and (71) requires that 6max <1/(I-—1). (738) 


Thus, we have demonstrated that the monotonicity of the bias function 
is implied if the quantity 6(d) defined in (63) is uniformly small. 

Let us now examine the probabilistic import of the condition in (71), 
namely, that 


6(d) = =F .(d)(mp+1 4 m,) 
be not large. First, recall from the definition of F,.(d) in (61) that 


F,(d) = 2(In Q)(E,Q4)p (E,Q%), laren =. (74) 
Thus, 


N- 
8(d) = 2(n Q) 7m (mp1 — m,)(E-Q*)p (E,Q2) 


N-1 
=2 2 In (M,+1/M,)(E-Q%)p(E-Q%). (75) 
Requiring that 5(d) be not too large is tantamount to requiring that the 


ratios of the multipliers, M,41,/M,, be not too large. To make this con- 
nection quite transparent, we see that 


6(d) < 2In (My/M;) | max yp) | (76) 
y 
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For p(-) Gaussian with variance o2, observe that 


max yp(y) = p(o) = 0.242, (77) 
y 


so that, in this case, (76) states that 
6(d) < 0.484 In (Myn/M)). (78) 


The above is not a particularly good bound, relative to the expression 
in (75), but it does illuminate the manner in which 6,,,, depends on the 
ratios of the multipliers. 

Finally, in summary let us recall in purely qualitative terms the reasons 
for requiring that 6(d) = =F,(d)(m,+1 — m,) be not large. This condition 
is tied in a natural way to the conditions that B,(d) =>—-1,1<k<I- 
1, which is at the core of the above analysis since it follows rather easily 
from these conditions that B;,(d) < 0, also. The conditions “B;,(y) = —1” 
have an entirely natural, underlying probabilistic interpretation. It 
merely states that, for two starting log step sizes, d(0) = d and d(0) = 
d’, where, say, the ordering is d < d’, the respective expected log step sizes 
after k iterations should also be ordered in the same way. A little thought 
is enough to convince one that such a condition can only be guaranteed 
by requiring that 6(d) be not too large, since 6(d) itself measures the 
potential for initial orderings to be reversed in one iteration. 


APPENDIX C 
Approximate Formula for the Central Log Step Sizes 


The object here is to derive the following approximate formula for the - 
dependence of the central log step size on the signal intensity, o: 


Capp(o) = S logge + D, : (79) 


where S and D, given in (37) and (38), are obtained from the fixed pa- 
rameters of the system. The sole approximation that is made is in ap- 
proximating the distribution of the input signal variables in the following 
manner: 


y 
f, P(u)du © a logay + ae, (80) 


where p(-) is the pdf of the input signal variables normalized to have unit 
variance. 

The procedure that is followed consists of first deriving the approxi- 
mation to the bias function, using the recursive formula in (33), and 
subsequently deriving the root of the approximate bias function. Observe 
that the recursive formula in (33) calls for the quantities b,(-),1 <r< 
N. We find it essential to work with the partial sums 
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F.(d)= > 0,(d), 1<r<N-1 
=1 


£,Q¢ 
=2 f, p(u)du 


~ Qa, loga(é-Q?¢/a) + 2ae, from (80), 
= 2a,d — 2a, logge + (2az + 2a4£,), (81) 
where o” is the variance of the input signal variables. Note that 
Fn(d) = 1. 
Examining (33), we find that we may also write it as follows [for no- 


tational simplicity, we drop the adjunct o in B;(d|o)|: 
forl<sk<I-1 


Ce Cea ne = F,(d){By-1(d + mp4) 


= Bp-1(d + my) + M;41 — mr}. (82) 
Now suppose that B,-1(d) may be expressed in the form 
Bp-1(d) = (fp-1 — 1)d + gr-1 logge + he-1, (83) 
where (fp—1, Zk—-1, Ne-1) do not depend on either d or o. Certainly, Bo(d) 
may be expressed in this form since Bo(d) = 0. We now show that B; (d) 
may also be expressed in the above manner. 
Upon substituting the above expression for B;,—1(d) and the expression 
in (81) for F,(d), in (82) we find that 
Bz(d) = (fx — 1)d + gp logge + hz, (84) 
where 
fr = {1 — 2a;(my — my)}fp-1, 
8k = Br-1 + 204(mn — M1)fe-1, 
—1 
hy = haa + {my ~ 2° (amar — me N(eak, + a9) fe (85) 
Certainly, the newly defined quantities are independent of d and loggs. 
Thus, the basis exists for an inductive construction. Further, the coupled 
recursions in (85) are trivial to solve for the initial conditions fo = 1, go 
= 0, ho = 0; thus, we obtain (f7-1, 27-1, h7-1). 


As is apparent from (23), the final iteration in the recursion for gen- 
erating the bias function differs from all the others. In fact, 


fr = ty — 20;(my — m,)}f1-1 
81 = 81-1 + 2a,(my — m,)f 1-1 
N-1 = 
hy = brat {mw ~ 2 E (meer — marks + a2)} fra (86) 
The complete solution for the approximation to the bias function 


is: 
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B(d) = (fr — 1)d + g; logge + hy, (87) 
where 
fr = —(1 — y){l — 2ay(my — m,)}F71 + {1 — 2ay(my — my)}, (88) 
gr = 1—{1 — 2a,;(my — m)}, 


N- ne 
my — 2 (m-+1— m,) (ay, + a) | {1 — {1 — 2a)(my — m,)}4} 


hy = 
2a1(my — Mm) 


Recall that the central log step size is the root of the bias function B(d). 
Thus, denoting by ¢app(c) the root of the function in (87), we obtain 


&1 hy 
—— lo + —— (89) 
1= fp ee =f 
= S logga + D, (90) 


where S and D, trivially identified by comparing the two expressions, 
are as given in the main text, (37) and (88). 


Capp(o) = 


APPENDIX D 
Formula for the Steady State, Mean Offset in Transmitter and Receiver Log 
Step Sizes 


We derive the formula for é given in (42). First, it is necessary to define 
certain quantities in connection with (40), which describes the step-size 
adaptations at the two sites. 

e(-) = d(-) —d’(-),_ the offset at time -, (91) 
and u(-) = m(-) — m’(-), the offset in the log multipliers at time -. From 
(40) we obtain 
e(i +1) = ye(t) + u(t) 
e(it 2) =e(i+1)+u(it+1) 1=0,J,2I,---. (92) 


eit+l)=e(-I-1)+uGi+J—1) 
Thus, 

e(ut+TJ) = ye(t) + {u@) tu@it+1)+---+u@itI—1)}. (93) 
Taking expectations of both sides of the equation, 

e(u+J) = ye(i) + {u(i) tu +1) +---+uG+7—1)}, (94) 


where the bar has been used to denote mean values. 

Consider u(z), the first term inside the parentheses. Observe that 
u(-)e(m, — m,|1 < r,s < N}. Also, 
| rth code word transmitted and sth 


N 
u(i) = m, —m,) Pr 
@) AG s) code word received at time 1. 


r,s=1 
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N 
= DY (m, — ms) Pr 


r,s=1 


ha code word recd.| rth code | 


word trans. 





= > (m,-—ms)Ts-p;(t), (95) 


r,s=1 


rth code word trans. N 
xX Pr : : 
at time i 


where T',, is simply the (s,r)th element of the channel transition matrix, 
and p-(t), 1 <r < N, is simply obtained from the pdf of the transmitter 
log step size at time 1. 

Expressions for @(i + 1),--+,u(i + I — 1) may similarly be derived. 
Thus, fori = 0,/,2/,--- 


nau =e > asa ie) 


r,s=1 


+---+p,(i+I—1)}. (96) 


To proceed further, it is necessary to assume ergodicity, i.e., more 
specifically, convergence in the mean for the time-evolving distributions 
of the transmitter log step size. With this assumption, asi — ~ 


é(i) > e (97) 

and 
pr(i) +++++ p,-(i +I — 1) > Ip,, 1<r<N, (98) 
where é and p; have the interpretations mentioned in the main text. 


Substituting in (94) and (96) yields 


I N 
> (m, ms)T srPr, (32) 
ea ( r,s=1 


which is what we set out to establish. 





e= 
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